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METHODS FOR THE PRODUCTION OF MULTIMERIC PROTEINS, 
AND RELATED COMPOSITIONS 

RELATED APPLICATIONS 

Benefit of priority under 35 U.S.C, §1 19(e) is claimed to. U.S. provisional 
5 application Serial No. 60/302,885, filed July 5, 2001, to van Rooijen, etal., 
entitled "METHODS FOR THE PRODUCTION OF REDOX PROTEINS". This 
application is also a continuation-in-part of U.S. utility application Serial No. 
10/006,038, filed December 4, 2001 to van Rooijen, et a/. r entitled "METHODS 
FOR THE PRODUCTION OF REDOX PROTEINS"; which is a continuation-in-part 

10 of U.S. utility application Serial No. 09/742,900, filed December 19, 2000 to 
Heifetz, et aL, entitled "METHOD OF PRODUCTION AND DELIVERY OF 
THIOREDOXIN". This application is also a continuation-in-part of U.S. utility 
application Serial No. 09/742,900. The subject matter of each of the provisional 
and utility applications is incorporated herein by reference in its entirety. 

1 5 Field Of The Invention 

The present invention relates to multimeric-protein-complexes, redox 
proteins, and recombinant polypeptides; and improved methods for their 
production. 
BACKGROUND 

20 Multimeric proteins {i.e. proteins comprising multiple polypeptide chains) 

are a biologically and commercially important class of proteins. Antibodies for 
example are multimeric proteins which are used to treat a wide range of disease 
conditions. However in view of their complexity, multimeric proteins frequently 
represent significant manufacturing challenges. 

25 Redox proteins are also a commercially important class of proteins with 

applications in a variety of different industries including the pharmaceutical, 
personal care and food industry. For example, the redox protein thioredoxin may 
be used in the manufacture of personal care products (Japanese Patent 
Applications JP9012471A2, JP103743A2, JP1 1 29785A2), pharmaceutical 

30 compositions/products (Aota et al. (1996) J. Cardiov. Pharmacol. (1996) 27: 

727-732) as well as to reduce protein allergens present in food products such as 
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milk (del Val et al. (1999) J. Allerg. Vlin. Immunol. 103: 690-697) and wheat 
(Buchanan et al. (1997) Proc. Natl. Acad. Sci. USA 94: 5372-5377). 

However, there is a need in the art to further improve the methods for the 
recombinant expression of multimeric proteins, including redox proteins. The 
5 present invention satisfies this need and provides related advantages as well. 
SUMMARY OF THE INVENTION 

The present invention relates to novel and improved methods of 
producing a first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

10 heteromultimeric-fusion-proteins, immunoglobulin-polypeptide-chains, 

immunoglobulins, redox-fusion-polypeptides, and/or thioredoxin-related proteins; 
in association with oil bodies. Accordingly, provided herein are methods of 
producing a recombinant multimeric-protein-complex, said method comprising: 
(a) producing in a cell comprising oil bodies, a first recombinant polypeptide and 

15 a second recombinant polypeptide wherein said first recombinant polypeptide is 
capable of associating with said second recombinant polypeptide to form said 
multimeric-protein-complex; and (b) associating said multimeric-protein-complex 
with an oil body through an oil-body-targeting-protein capable of associating with 
said oil bodies and said first recombinant polypeptide. 

20 The method further contemplates isolating the oil bodies associated with 

said recombinant multimeric-protein-complex. The second recombinant 
polypeptide can be associated with a second oil-body-targeting-protein capable 
of associating with an oil body and said second recombinant polypeptide. Each 
of said oil-body-targeting-proteins can be an oil-body-protein or an 

25 immunoglobulin. The oil-body-targeting-protein can be an oleosin or caleosin. 
When the oil-body-targeting-protein can be an oleosin or caleosin, the first 
recombinant polypeptide can be fused to said oleosin or caleosin. Likewise, the 
second recombinant polypeptide can be fused to a second oleosin or second 
caleosin capable of associating with an oil body. The first and second 

30 recombinant polypeptides can be produced as a multimereic-fusion-protein 

comprising said first and second polypeptide, and can form a multimeric-protein- 
complex. The multimeric-protein-complex can be a heteromultimeric-protein- 
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complex, and the heteromultimeric-protein-complex can be an enzymatically 
active redox complex or an immunoglobulin. In one embodiment, the first 
recombinant polypeptide is capable of associating with said second recombinant 
polypeptide in the cell. In another embodiment, the first recombinant 
5 polypeptide can be a thioredoxin and the second recombinant polypeptide can be 
a thioredoxin-reductase. In particular embodiments, the thioredoxin can be 
selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-1 94; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 

10 NOs:195-31 3. In another embodiment, the first recombinant polypeptide can be 
an immunoglobulin-polypeptide-chain. For example, the first recombinant 
polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof- In 

15 this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 
or protein Q. The cell can be a plant cell, such as a safflower cell, and the like. 

Also provided herein is a method of expressing a recombinant multimeric- 
protein-complex comprising a first and second recombinant polypeptide in a cell, 
said method comprising: 

20 (a) introducing into a cell a first chimeric nucleic acid sequence comprising: 

(i) a first nucleic acid sequence capable of regulating transcription 
in said cell operatively linked to; 

(ii) a second nucleic acid sequence encoding a first recombinant 

polypeptide; 

25 (b) introducing into said cell a second chimeric nucleic acid sequence comprising: 

(i) a third nucleic acid sequence capable of regulating transcription 
in said cell operatively linked to; 

(ii) a fourth nucleic acid sequence encoding a second recombinant 

polypeptide; 

30 (c) growing said cell under conditions to permit expression of said first-and 

second recombinant polypeptide in a progeny cell comprising oil bodies wherein 
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said first recombinant polypeptide and said second recombinant polypeptide are 
capable of forming a multimeric-protein-complex; and 

(d) associating said first recombinant polypeptide with an oil body through an oil- 
body-targeting-protein capable of associating with said oil bodies and said first 
5 recombinant polypeptide. This method further contemplates isolating from the 
progeny cell, oil bodies comprising the multimeric-protein-complex. The second 
recombinant polypeptide can be associated with a second oil-body-targeting- 
protein capable of associating with an oil body and second recombinant 
polypeptide. Each of said oil-body-targeting-proteins can be an oil-body-protein 

10 or an immunoglobulin. The oil-body-targeting-protein can be an oleosin or 

caleosin. When the oil-body-targeting-protein is an oleosin or caleosin, the first 
recombinant polypeptide can be fused to said oleosin or caleosin. Likewise, the 
second recombinant polypeptide can be fused to a second oleosin or second 
caleosin capable of associating with an oil body. The first and second 

1 5 recombinant polypeptides can be produced as a multimereic-fusion-protein 

comprising said first and second polypeptide, and can form a multimeric-protein- 
complex. The multimeric-protein-complex can be a heteromultimeric-protein- 
complex, and the heteromultimeric-protein-complex can be an enzymatically 
active redox complex or an immunoglobulin. In one embodiment the first 

20 recombinant polypeptide and said second recombinant polypeptide are capable of 
forming a multimeric-protein-complex in said progeny cell. In another 
embodiment, the first recombinant polypeptide can be a thioredoxin and the 
second recombinant polypeptide can be a thioredoxin-reductase. In particular 
embodiments, the thioredoxin can be selected from the group consisting of SEQ 

25 ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194; and the thioredoxin-reductase 
can be selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 
10, 40, 44, 48, 50 and SEQ ID NOs: 195-31 3. In another embodiment, the first 
recombinant polypeptide can be an immunoglobulin-polypeptide-chain. For 
example, the first recombinant polypeptide can be an immunoglobulin light chain, 

30 or an immunologically active portion thereof, and the second recombinant 

polypeptide can be an immunoglobulin heavy chain, or an immunologically active 
portion thereof. In this embodiment, the oil-body-targeting-protein can comprise 
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protein A, protein L or protein G. The cell can be a plant cell, such as a 
safflower cell, and the like. 

Also provided herein are methods of producing in a plant a recombinant 
multimeric-protein-complex, said method comprising: 
5 (a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide wherein said first recombinant polypeptide is 
capable of associating with said oil bodies through an oil-body-targeting-protein; 
(b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide; and 

10 (c) sexually crossing said first plant with said second plant to produce a progeny 
plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 
are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 

1 5 The second recombinant polypeptide can be associated with oil bodies through a 
second oil-body-targeting-protein in the second plant. The oil bodies can be 
isolated from the progeny plant comprising said multimeric-protein-complex. The 
oil-body-targeting-protein can be selected from an oil-body-protein or an 
immunoglobulin, wherein the oil-body-protein can be an oleosin or caleosin. The 

20 first recombinant polypeptide can be fused to the oleosin or caleosin; and the 
second recombinant polypeptide can be fused to a second oleosin or second 
caleosin capable of associating with an oil body. The first and second 
recombinant polypeptide can form a multimeric-protein-complex, such as a 
heteromultimeric-protein-complex, wherein the heteromultimeric-protein-complex 

25 can be an enzymatically active redox complex or an immunoglobulin. In a 

particular embodiment, the first recombinant polypeptide is a thioredoxin and the 
second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 

30 consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313. In another embodiment, the first recombinant polypeptide can be 
an immunoglobulin-polypeptide-chain. For example, the first recombinant 
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polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 
5 or protein G. The plant can be a safflower plant. 

Also provided herein are chimeric nucleic acids encoding a multimeric- 
fusion-protein as described herein, said nucleic acid comprising: 
(a) a first nucleic acid sequence encoding an oil-body-targeting-protein 
operatively linked in reading frame to; 
10 (b) a second nucleic acid sequence encoding a first recombinant polypeptide; 
linked in reading frame to; 

(c) a third nucleic acid sequence encoding a second recombinant polypeptide, 
wherein said first and second recombinant polypeptide are capable of forming a 
multimeric-protein-complex. The oil-body-targeting-protein can be selected from 

15 an oil-body-protein or an immunoglobulin. The oil-body-protein can be an oleosin 
or caleosin. The multimeric-protein-complex can be a heteromultimeric-protein- 
complex, and the first and second recombinant polypeptide can form an 
enzymatically active heteromultimeric redox complex or an immunoglobulin. In a 
particular embodiment, the first recombinant polypeptide is a thioredoxin and the 

20 second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313. In another embodiment, the first recombinant polypeptide can be 

25 an immunoglobulin-polypeptide-chain. For example, the first recombinant 

polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 

30 or protein G. In yet another embodiment, positioned between the nucleic acid 
sequence encoding an oil-body-targeting-protein and the nucleic acid sequence 
encoding a first recombinant polypeptide can be a linker nucleic acid sequence 
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encoding an oil-body-surface-avoiding linker amino acid sequence. The oil-body- 
surface-avoiding linker amino acid sequence can be substantially negatively 
charged, or have a molecular weight of at least 35 kd. Optionally, the gene 
fusion further comprises a linker nucleic acid sequence encoding an amino acid 
5 sequence that is specifically cleavable by an enzyme or a chemical, wherein the 
linker sequence is positioned between the oil-body-surface-avoiding linker amino 
acid sequence that is also a non-proteolytic linker and said sequence encoding 
the first recombinant polypeptide. 

Also provided herein are recombinant multimeric-fusion-proteins 

10 comprising <i) an oil-body-targeting-protein, or fragment thereof, (ii) a first 
recombinant polypeptide and a (iii) second recombinant polypeptide, wherein 
said first and second recombinant polypeptides are capable of forming a 
multimeric-protein-complex. The oil-body-targeting-protein can be selected from 
an oil-body-protein or an immunoglobulin, and the oil-body-protein can be an 

15 oleosin or a caleosin. The multim eric-fusion-protein can be a heteromultimeric- 
fusion-protein, wherein said first and second recombinant polypeptide form an 
enzymatically active heteromultimeric redox complex or an immunoglobulin. In a 
particular embodiment, the first recombinant polypeptide is a thioredoxin and the 
second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 

20 be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313. In another embodiment, the first recombinant polypeptide can be 
an immunoglobulin-polypeptide-chain. For example, the first recombinant 

25 polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 
or protein G. In yet another embodiment, positioned between the nucleic acid 

30 sequence encoding an oil-body-targeting-protein and the nucleic acid sequence 
encoding a first recombinant polypeptide can be a linker nucleic acid sequence 
encoding an oil-body-surface-avoiding linker amino acid sequence. The oil-body- 
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surface-avoiding linker amino acid sequence can be substantially negatively 
charged, or have a molecular weight of at least 35 kd. Optionally, the gene 
fusion further comprises a linker nucleic acid sequence encoding an amino acid 
sequence that is specifically cleavable by an enzyme or a chemical, wherein the 
5 linker sequence is positioned between the oil-body-surface-avoiding linker amino 
acid sequence and said sequence encoding the first recombinant polypeptide. 

Also provided herein are isolated oil bodies comprising a multimeric- 
protein-complex comprising (i) an oil-body-targeting-protein and (ii) a first 
recombinant polypeptide, said oil bodies further comprising a second 

10 recombinant polypeptide, wherein said first and second recombinant polypeptide 
are capable of forming a multimeric-protein-complex. The oil-body-targeting- 
protein can be selected from an oil-body-protein or an immunoglobulin, and the 
oil-body-protein can be an oleosin or a caleosin. The multimeric-fusion-protein 
can be a heteromultimeric-fusion-protein, wherein said first and second 

15 recombinant polypeptide form an enzymatically active heteromultimeric redox 
complex or an immunoglobulin. In a particular embodiment, the first 
recombinant polypeptide is a thioredoxin and the second recombinant 
polypeptide is a thioredoxin-reductase. In another embodiment, the first 
recombinant polypeptide can be an immunoglobulin-polypeptide-chain. For 

20 example, the first recombinant polypeptide can be an immunoglobulin light chain, 
or an immunologically active portion thereof, and the second recombinant 
polypeptide can be an immunoglobulin heavy chain, or an immunologically active 
portion thereof. In this embodiment, the oil-body-targeting-protein can comprise 
protein A, protein L or protein G. 

25 Also provided herein are isolated oil bodies comprising 

(a) a first fusion protein comprising a first oil-body-targeting-protein fused to a 
first recombinant polypeptide; and 

(b) a second fusion protein comprising a second oil-body-targeting-protein fused 
to a second recombinant polypeptide, 

30 wherein said first and second recombinant polypeptide are capable of forming a 
multimeric-protein-complex. The oil-body-targeting-protein can be selected from 
an oil-body-protein or an immunoglobulin, and the oil-body-protein can be an 
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oleosin or a caleosin. The multimeric-fusion-protein can be a heteromultimeric- 
fusion-protein, wherein said first and second recombinant polypeptide form an 
enzymatically active heteromultimeric redox complex or an immunoglobulin. In a 
particular embodiment, the first recombinant polypeptide is a thioredoxin and the 
5 second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194;. and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313. In another embodiment, the first recombinant polypeptide can be 

10 an immunoglobulin-polypeptide-chain. For example, the first recombinant 

polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 

15 or protein G. 

Also provided are cells and transgenic plants comprising oil bodies, 
multimeric-protein-complexes, and multimeric-fusion-proteins, set forth herein. 
In one embodiment, the first recombinant polypeptide can be an immunoglobulin- 
polypeptide-chain. For example, the first recombinant polypeptide can be an 

20 immunoglobulin light chain, or an immunologically active portion thereof, and the 
second recombinant polypeptide can be an immunoglobulin heavy chain, or an 
immunologically active portion thereof. In this embodiment, the oil-body- 
targeting-protein can comprise protein A, protein L or protein G. In 
embodiments, wherein said first recombinant polypeptide is a thioredoxin and 

25 said second recombinant polypeptide is a thioredoxin-reductase, the methods 
described herein can be used to formulate the oil bodies for use in the 
preparation of a food product, personal care product or pharmaceutical 
composition. These formulations can further comprise the addition of NADP or 
NADPH. The food product can be a milk or wheat based food product. The 

30 personal care product can reduce the oxidative stress to the surface area of the 
human body or can be used to lighten the skin. The pharmaceutical composition 
can be used to treat chronic obstructive pulmonary disease (COPD), cataracts, 
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diabetes, envenomation, bronchiopulmonary disease, malignancies, psoriasis, 
reperfusion injury, wound healing, sepsis, Gl bleeding, intestinal bowel disease 
(IBD), ulcers, GERD (gastro esophageal reflux disease). 

Also provided herein are compositions comprising isolated oil bodies, 
5 thioredoxin and thioredoxin-reductase, wherein said thioredoxin can be selected 
from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52- 
1 94, and said thioredoxin-reductase can be selected from the group consisting 
of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID NOs:195- 
313. The composition can further comprise NADP or NADPH. In another 

10 embodiment, the composition comprises a first recombinant polypeptide that can 
be an immunoglobulin-polypeptide-chain and a second recombinant polypeptide. 
For example, the first recombinant polypeptide can be an immunoglobulin light 
chain, or an immunologically active portion thereof, and the second recombinant 
polypeptide can be an immunoglobulin heavy chain, or an immunologically active 

15 portion thereof. In this embodiment, the oil-body-targeting-protein can comprise 
protein A, protein L or protein G. 

Also provided are multimeric-fusion-proteins, wherein the fusion-protein 
contains two or more polypeptide chains selected from the group of proteins set 
forth in Figure 5. Methods are also provided of reducing allergenicity of a food 

20 comprising the steps of providing the isolated oil bodies set forth herein; and 
adding the isolated oil bodies to the food, whereby allergenicity of the food is 
reduced. The food can be selected from the group consisting of wheat flour, 
wheat dough, milk, cheese, yogurt and ice cream. The various methods of 
treating food can further comprise providing NADH as a co-factor in the 

25 substantial absence of NADPH. 

Also provided herein are methods of treating or protecting a target 
against oxidative stress, comprising the steps of providing the recombinant 
redox fusion polypeptide comprising thioredoxin and thioredoxin-reductase; and 
contacting the recombinant fusion polypeptide with a target, wherein the target 

30 is susceptible to oxidative stress, thereby treating or protecting against the 
stress. The target can be selected from the group consisting of a molecule, a 
molecular complex, a cell, a tissue, and an organ. 
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Also provided herein are methods for preparing an enzymatically active 
redox protein associated with oil bodies comprising: 

a) producing in a cell a redox fusion polypeptide comprising a first 
redox protein linked to a second redox protein; 
5 b) associating said redox fusion polypeptide with oil bodies through 

an oil-body-targeting-protein capable of associating with said redox fusion 
polypeptide and said oil bodies; and 

c) isolating said oil bodies associated with said redox fusion 
polypeptide. The first redox protein can be a thioredoxin and the second redox 
10 protein can be a thioredoxin-reductase. 

Also, provided herein are methods of producing an immunoglobulin, said 
method comprising: (a) producing in a cell comprising oil bodies, a first 
immunoglobulin-polypeptide-chain and a second immunoglobulin-polypeptide- 
chain wherein said first immunoglobulin-polypeptide-chain is capable of 
1 5 associating with said second immunoglobulin-polypeptide-chain to form said 
immunoglobulin; and (b) associating said immunoglobulin with an oil body 
through an oil-body-targeting-protein capable of associating with said oil bodies 
and said first immunoglobulin-polypeptide-chain. For example, the first 
immunoglobulin-polypeptide-chain can be an immunoglobulin light chain, or an 
20 immunologically active portion thereof, and the second immunoglobulin- 

polypeptide-chain can be an immunoglobulin heavy chain, or an immunologically 
active portion thereof. In this embodiment, the oil-body-targeting-protein can 
comprise protein A, protein L or protein G. 

Also provided herein are methods for preparing a redox protein or an 
25 immunoglobulin associated with oil bodies comprising: 

a) introducing into a cell a chimeric nucleic acid sequence comprising: 

1 ) a first nucleic acid sequence capable of regulating 
transcription in said cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant 
30 fusion polypeptide comprising (i) a nucleic acid sequence encoding 

a sufficient portion of an oil-body-protein to provide targeting of 
said recombinant fusion polypeptide to an oil body linked to (ii) a 
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nucleic acid sequence encoding a redox fusion polypeptide 
comprising a first redox protein linked to a second redox protein, 
or a nucleic acid sequence encoding a immunoglobulin comprising 
a first immunoglobulin-polypeptide-chain linked to a second 
5 immunoglobulin-polypeptide-chain, operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
transcription in said cell; 

b) growing said cell under conditions to permit expression of said 
redox fusion polypeptide or immunoglobulin in a progeny cell comprising oil 

10 bodies; and 

c) isolating from said progeny cell said oil bodies comprising said 
redox fusion polypeptide or immunoglobulin. In certain embodiments, positioned 
between said nucleic acid sequence encoding a sufficient portion of an oil-body- 
protein and said nucleic acid sequence encoding a redox fusion polypeptide or 

15 immunoglobulin can be a linker nucleic acid sequence encoding an oil-body- 
surface-avoiding linker amino acid sequence. The oil-body-surface-avoiding 
linker amino acid sequence can be substantially negatively charged or have a 
molecular weight of at least 35 kd. Optionally, the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 

20 is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 
and said nucleic acid sequence encoding a redox fusion polypeptide. In this 
optional embodiment, also contemplated is the introduction of an enzyme or 
chemical that cleaves said redox fusion polypeptide from said oil body, thereby 

25 obtaining isolated redox fusion polypeptide. The first redox protein can be a 
thioredoxin and said second redox protein can be a thioredoxin-reductase. In 
one embodiment, the thioredoxin and thioredoxin-reductase can be obtained 
from Arabidopsis. In another embodiment, the first redox protein is at least 5 
times more active when produced as a redox fusion polypeptide as compared to 

30 the production of the first redox protein without the second redox prptein. 

Also provided herein, for use with the various methods set forth herein is 
the formulation of an emulsion of the oil bodies associated with the redox fusion 
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polypeptide for use in the preparation of a product capable of treating oxidative 
stress in a target, a product capable of chemically reducing a target, 
pharmaceutical composition, a personal care product or a food product. 
Accordingly, an emulsion formulation composition is provided. 
5 Also provided herein is a chimeric nucleic acid comprising: 

1 ) a first nucleic acid sequence capable of regulating transcription in 
a host cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a nucleic acid sequence encoding a sufficient portion 

10 of an oil-body-protein to provide targeting of said recombinant fusion polypeptide 
to an oil body linked to (ii) a nucleic acid sequence encoding a redox fusion 
polypeptide comprising a first redox protein linked to a second redox protein 
operatively linked to; 

3) a third nucleic acid sequence capable of terminating transcription 
15 in said cell. The oil-body-protein can be an oleosin or a caleosin, the first redox 

protein can be a thioredoxin and said second redox protein can be a thioredoxin- 
reductase. In certain embodiments, positioned between said nucleic acid 
sequence encoding a sufficient portion of an oil-body-protein and said nucleic 
acid sequence encoding a redox fusion polypeptide is a linker nucleic acid 

20 sequence encoding an oil-body-surface-avoiding linker amino acid sequence. 
The oil-body-surface-avoiding linker amino acid sequence can be substantially 
negatively charged, or have a molecular weight of at least 35 kd. In one 
embodiment, the gene fusion optionally further comprises a linker nucleic acid 
sequence encoding an amino acid sequence that is specifically cleavable by an 

25 enzyme or a chemical, wherein the linker sequence is positioned between the oil- 
body-surface-avoiding linker amino acid sequence and said nucleic acid sequence 
encoding a redox fusion polypeptide. 

Also provided herein are transgenic plants, e.g., saf flower plants, 
comprising any of the chimeric nucleic acid sequences and constructs described 

30 herein. The chimeric nucleic acids can be contained within a plastid., 

Accordingly, isolated plastids are provided having chimeric nucleic acids therein. 
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Also provided are plant seeds comprising the chimeric nucleic acids provided 
herein. 

Also provided are oil body preparations obtained using any of the 
methods provided herein, and food products, pharmaceutical compositions, and 
5 personal care products containing the oil body preparations. The products 

and/or compositions provided herein are capable of treating oxidative stress in a 
target, capable of chemically reducing a target. Also provided is a detergent 
composition comprising an oil body preparation capable of chemically reducing a 
target, and related methods of cleansing an item, comprising administering such 

10 product to the item under conditions that promote cleansing. 

Also provided herein are nucleic acid constructs comprising a gene fusion, 
wherein the gene fusion comprises a first region encoding an oil-body-protein or 
an active fragment thereof, operably linked to a second region encoding at least 
one thioredoxin-related protein or an active fragment thereof. In one 

15 embodiment, the at least one thioredoxin-related protein can be thioredoxin. The 
thioredoxin can be selected from the group consisting of SEQ ID NOs:38, 42, 
46, 50 and SEQ ID NOs:52-194. The thioredoxin can be obtained from 
Arabidopsis or wheat. 

In another embodiment, the at least one thioredoxin-related protein can be 

20 thioredoxin-reductase. The thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313 and/or derived from Arabidopsis or wheat. The thioredoxin- 
reductase can be an NADPH-dependent thioredoxin-reductase.. The second 
region can encode a thioredoxin and thioredoxin-reductase. In one embodiment, 

25 the thioredoxin and thioredoxin-reductase is obtained from Mycobacterium 

leprae. In another embodiment, the at least one thioredoxin-related protein can 
be an engineered fusion protein. The first region can precede, in a 5' to 3' 
direction, the second region. Alternatively, the first region follows, in a 5' to 3' 
direction, the second region. The gene fusion can optionally further comprise a 

30 third region encoding a second thioredoxin-related protein or an active fragment 
thereof, operably linked to the first region, or to the second region, or to both. A 
seed-specific promoter, such as a phaseolin promoter, can be operably linked to 
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the gene fusion. In one embodiment, at least one thioredoxin-related protein is 
derived from a plant species selected from the group consisting of Arabidopsis 
and wheat. In another embodiment, at least one thioredoxin-related protein can 
be derived from E. coIL 
5 In one embodiment, the gene fusion further comprises a nucleic acid 

sequence encoding an oil-body-surface-avoiding linker amino acid sequence, 
wherein the linker amino acid sequence is positioned between the first region 
and the second region. The oil-body-surface-avoiding linker amino acid sequence 
can be substantially negatively charged, or have a molecular weight of at least 

10 35 kd. In addition, the gene fusion can further comprise a linker nucleic acid 
sequence encoding an amino acid sequence that is specifically cleavable by an 
enzyme or a chemical, wherein the linker sequence is positioned between the oil- 
body-surface-avoiding linker amino acid sequence and the second region. 
Also provided herein are transgenic plants containing a nucleic acid 

15 construct comprising a gene fusion, wherein the gene fusion comprises a region 
encoding an oil-body-protein or an active fragment thereof, operably linked to a 
region encoding a first thioredoxin-related protein or an active fragment thereof. 
The thioredoxin-related protein can be thioredoxin. The nucleic acid construct 
can be contained within a plastid. In one embodiment, when the first 

20 thioredoxin-related protein is thioredoxin and the construct can further comprise 
a region encoding a thioredoxin-reductase. The gene fusion can optionally 
further comprise a third region encoding a second thioredoxin-related protein or 
an active fragment thereof, operably linked to the first region, or to the second 
region, or to both. The gene fusion can optionally further comprise a nucleic 

25 acid sequence encoding an oil-body-surface-avoiding linker amino acid sequence, 
wherein the nucleic acid encoding the linker amino acid sequence is positioned 
between the region encoding an oil-body-protein and the region encoding a first 
thioredoxin-related protein. The oil-body-surface-avoiding linker amino acid 
sequence can be substantially negatively charged, or have a molecular weight of 

30 at least 35 kd. The gene fusion can optionally further comprise a linker nucleic 
acid sequence encoding an amino acid sequence that is specifically cleavable by 
an enzyme or a chemical, wherein the linker sequence is 
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positioned between the oil-body-surface-avoiding linker amino acid sequence and 
the region encoding a first thioredoxin-related protein. 

Also provided is a transgenic plant comprising a nucleic acid construct, a 
seed-specific promoter operably linked to a gene fusion, wherein the gene fusion 
5 comprises a region encoding an oil-body-protein or an active fragment thereof, 
operably linked to a region encoding a first thioredoxin-related protein or an 
active fragment thereof, wherein a fusion protein comprising activities of oleosin 
and the thioredoxin-related protein is produced in a seed of the plant. In another 
embodiment, a thioredoxin-related protein having concentration of at least about 

10 0.5% of total cellular seed protein is provided. Also provided herein is an 

extract comprising an activity of a thioredoxin-related protein. Also provided are 
oil bodies and/or oil obtained from various seeds. 

Also provided herein are methods of making a fusion protein comprising a 
thioredoxin-related activity, the method comprising the steps of: 

15 a) providing a transgenic plant comprising a nucleic acid construct 

comprising a seed-specific promoter operably linked to a gene fusion, wherein 
the gene fusion comprises a region encoding an oil-body-protein or an active 
fragment thereof, operably linked to a region encoding a first thioredoxin-related 
protein or an active fragment thereof, the gene fusion encoding a fusion protein 

20 comprising a thioredoxin-related activity; 

b) obtaining seeds from the plant; and 

c) recovering the fusion protein by isolating oil bodies from the seeds. In 
one embodiment, the oil bodies are fractionated to achieve partial purification of 
the fusion protein. The oil bodies can be in association with a fusion protein. 

25 The oil-body-protein, can be cleaved from the thioredoxin-related protein after 
fractionation of the oil bodies. The cleaving step can make use of a protease or 
chemical proteolysis. 

Also provided herein are methods of reducing allergenicity of a food 
comprising the steps of: 

30 a) providing a preparation comprising oil bodies associated with a fusion 

protein, the fusion protein comprising an oil-body-protein or an active fragment 
thereof and a thioredoxin-related protein or an active fragment thereof; and 
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b) adding the preparation to the food, whereby allergenicity of the food is 
reduced due to activity of the thioredoxin-related protein or fragment. The food 
can be wheat flour, wheat dough, milk, cheese, yogurt and ice cream. In one 
embodiment, NADH is used as a co-factor in the substantial absence of NADPH. 
5 Also provided herein are pharmaceutical compositions comprising a fusion 

protein, the fusion protein comprising an oil-body-protein or an active fragment 
thereof and a thioredoxin-related protein or an active fragment thereof, in a 
pharmaceutically acceptable carrier. The oil bodies can be associated with the 
fusion protein. Also provided is a cosmetic formulation comprising oil bodies 

10 associated with a fusion protein, the fusion protein comprising an oil-body- 
protein or an active fragment thereof and a thioredoxin-related protein or an 
active fragment thereof, in a pharmaceutically acceptable carrier. Also provided 
are methods of treating or protecting a target against oxidative stress, 
comprising the steps of: 

15 a) providing a preparation comprising a fusion protein, the fusion protein 

comprising an oil-body-protein or an active fragment thereof and a thioredoxin- 
related protein or an active fragment thereof; and 

b) contacting the preparation with a target, wherein the target is 
susceptible to oxidative stress, thereby treating or protecting against the stress. 

20 The target can be selected from the group consisting of a molecule, a molecular 
complex, a cell, a tissue, and an organ. 

Also provided is a nucleic acid construct comprising a gene fusion, 
wherein the gene fusion comprises a first region encoding an oil-body-protein or 
an active fragment thereof, operably linked to a second region encoding at least 

25 one polypeptide or an active fragment thereof, and an oil-body-surface-avoiding 
linker in frame between the first and second region polypeptides. Also provided 
are methods of expressing this construct into the encoded amino acid sequence; 
and oil bodies, formulations, emulsions, cells, and plants comprising the 
construct and encoded amino acid sequence. These particular constructs, oil 

30 bodies, formulations, emulsions, cells, and plants can be produced according to 
the methods described herein. The second region can encode any polypeptide, 
for example, a therapeutically, nutritionally, industrially or cosmetically useful 
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peptide as set forth herein. For example, the second region can encode a redox 
protein, an immunoglobulin, a thioredoxin-related protein or any one or more 
recombinant polypeptides of a multimeric-protein-complex. 

Other features and advantages of the present invention will become 
5 readily apparent from the following detailed description. It should be understood 
however that the detailed description and the specific examples while indicating 
particular embodiments of the invention are given by way of illustration only. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figure 1 shows a ClustalW Formatted Alignment comparison of the 

published NADPH thioredoxin-reductase nucleic acid sequence (SEQ ID NO:9) 
(ATTHIREDB-Jacquot et al. J. Mol. Biol. (1994) 235 (4): 1357-63.) with the 
sequence isolated herein in Example 1 (TR; SEQ ID NO:8). 

Figure 2 shows a ClustalW Formatted Alignment comparison of the 

15 deduced amino acid sequence of the published NADPH thioredoxin-reductase 
sequence (SEQ ID NO:1 2KATTHIREDB Jacquot et al. J. Mol. Biol. (1994) 235 
(4):1 357-63.) with the sequence isolated herein in Example 1 (TR; SEQ ID 
NO:13). 

Figure 3 shows a clustal alignment comparing the amino acid sequence of 
20 the Arabidopsis thaliana thioredoxin-reductase-linker-thioredoxin synthetic fusion 
(Arab TR-link-Trxh; SEQ ID NO:37) to the Mycobacterium leprae thioredoxin- 
reductase-thioredoxin natural fusion (M.lep TR/T rxh; SEQ ID NO:36) natural 
fusion. Overall, the proteins are approximately 50% identical at the amino acid 
level. 

25 Figure 4 is a bar graph showing the thioredoxin/thioredoxin-reductase 

activity measurements for the various transgenic Arabidopsis seed fractions. 
Relative specific activity is expressed as a percentage of the E. coli thioredoxin 
and thioredoxin-reductase activities. The numbered bars in the graph correspond 
to the following: 

30 1 . W.T. + oleosin-thioredoxin 

2. W.T. + thioredoxin-oleosin 

3. W.T. + thioredoxin 
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4. W.T. + oleosin-thioredoxin-reductase 

5. W.T, + thioredoxin-reductase-oleosin 

6. W.T. + thioredoxin-reductase 

7. thioredoxin + oleosin-thioredoxin-reductase 
5 8. thioredoxin 4- thioredoxin-reductase-oleosin 

9. thioredoxin 4- thioredoxin-reductase 

10. thioredoxin-reductase + oleosin-thioredoxin 

1 1 . thioredoxin-reductase + thioredoxin-oleosin 

12. oleosin-ZW./ep TR/Trxh 

10 13. Em coli thioredoxin-reductase + thioredoxin 

Figure 5 provides a listing of exemplary proteins for use in the 
heteromultimeric-fusion-proteins and heteromultimeric-protein-complexes 
provided herein. 
DETAILED DESCRIPTION 

15 As hereinbefore mentioned, the present invention relates to novel and 

improved methods for the production of multimeric proteins, including a first and 
second recombinant polypeptide, multimeric-protein-complexes, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulin-polypeptide-chains, 

20 immunoglobulins, redox-fusion-polypeptides, and a first and second thioredoxin- 

o 

related protein; and related products. These methods permit the production of 
active multimeric-protein-complexes in association with oil bodies. The oil 
bodies in association with the multimeric-protein-complex may be used to 
prepare various useful emulsions. 

25 Accordingly, provided herein are methods of producing a recombinant 

multimeric-protein-complex associated with an oil body, said method comprising: 

(a) producing in a cell comprising oil bodies, a first recombinant 
polypeptide and a second recombinant polypeptide wherein said first 
recombinant polypeptide is capable of associating with said second recombinant 

30 polypeptide in the cell to form said multimeric-protein-complex; and 
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(b) associating said multimeric-protein-complex with an oil body through 
an oil-body-targeting-protein capable of associating with said oil body and said 
first recombinant polypeptide. 
Definitions and terms 
5 Unless defined otherwise, all technical and scientific terms used herein 

have the same meaning as is commonly understood by one of skill in the art to 
which this invention belongs. Where permitted, all patents, applications, 
published applications and other publications and sequences from GenBank, 
SwissPro and other data bases referred to throughout in the disclosure herein are 

10 incorporated by reference in their entirety. 

As used herein, the phrase "multimeric-protein-complex", refers to two or 
more polypeptide chains that permanently or repeatedly interact or permanently 
or repeatedly coordinate to form a biologically active assembly comprising said 
two or more polypeptide chains. It should be noted that the polypeptides may 

15 be independently biologically active without interaction or coordination to form 
the complex. The multimeric-protein-complex may provide a biological structure, 
or it may be capable of facilitating a chemical or biological reaction. For 
example, one of the protein regions within the multimeric-protein-complex can 
repeatedly activate or repeatedly inactivate the biological or metabolic activity of 

20 one or mqre of the other proteins contained within the multimeric-protein- 
complex. In one embodiment, the first and second recombinant polypeptide 
contained in a multimeric-protein-complex may either associate or interact as 
independent non-contiguous polypeptide chains or the multimeric-protein- 
complex may be prepared as a fusion polypeptide (multimeric-fusion-protein) 

25 between the first and second recombinant polypeptide. 

One example of a repeated (e.g., reoccurring) interaction or association 
between the two or more polypeptides of a multimeric-protein-complex provided 
herein is the interaction between two or more non-identical redox proteins to 
form a heteromultimeric-protein-complex. Exemplary redox proteins for use in 

30 this regard are thioredoxin and the thioredoxin-reductase. A further example is 
the interaction between two or more immunoglobulin-polypeptide-chains to form 
an immunoglobulin. As used herein, the phrase "heteromultimeric-protein- 
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complex", refers to two or more non-identical polypeptide chains that 
permanently or repeatedly interact or permanently or repeatedly coordinate to 
form a biologically active assembly comprising said two or more polypeptide 
chains. Other examples of multimeric-protein-complexes provided herein include 
5 a first and second recombinant polypeptide, heteromultimeric-protein-complexes, 
multimeric-fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
first and second immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
and a first and second thioredoxin-related protein. 

The recombinant polypeptide or multimeric-protein-complex is associated 

10 with an oil body. As used herein, the phrase "oil body" or "oil bodies" refers to 
any oil or fat storage organelle in any cell type. Accordingly, the oil bodies may 
be obtained from any cell comprising oil bodies, including plant cells (described 
in for example: Huang (1992) Ann. Rev. Plant Mol. Biol. 43: 177-200), animal 
cells (described in for example: Murphy (1990) Prog Lipid Res 29(4): 299-324), 

15 including adipocytes, hepatocytes, steroidogenic cells, mammary epithelial cells, 
macrophages, algae cells (described in for example: Rossler (1988) J. Physiol. 
London, 24: 394-400) fungal cells, including yeast cells (described in for 
example Leber et al. (1994) Yeast 10: 1421-1428) and bacterial cells (described 
in for example: Pieper-Furst et al. (1994) J. Bacteriol. 176: 4328-4337). 

20 Preferably the oil bodies used herein are oil bodies obtainable from plant cells 
and more preferably the oil bodies obtainable from plant seed cells. 

As used herein, the phrase "is capable of associating with", "associate" 
or grammatical variations thereof, refers to any interaction between two or more 
polypeptides, including any covalent interactions (e.g. muftimeric-fusion-proteins) 

25 as well as non-covalent interactions. Exemplary non-covalent interactions can 
be between the oil-body-targeting-protein and a redox protein or immunoglobulin- 
polypeptide-chain, as well as between two or more different proteins contained 
within two or more separate oil-body-protein fusion proteins (e.g., the redox 
proteins in oleosin-thioredoxin and oleosin-thioredoxin-reductase). 

30 As used herein, the term "recombinant" (also referred to as heterologous) 

in the context of recombinant proteins and amino acids, means "of different 
natural origin" or represents a non-natural state. For example, if a host cell is 
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transformed with a nucleotide sequence derived from another organism, 
particularly from another species, that nucleotide sequence and amino acid 
sequence encoded thereby, is recombinant (heterologous) with respect to that 
host cell and also with respect to descendants of the host cell which carry that 
5 gene. Similarly, recombinant (or heterologous) refers to a nucleotide sequence 
derived from and inserted into the same natural, original cell type, but which is 
present in a non-natural state, e.g., a different copy number, or under the control 
of different regulatory elements. A transforming nucleotide sequence may 
include a recombinant coding sequence, or recombinant regulatory elements. 

1 0 Alternatively, the transforming nucleotide sequence may be completely 

heterologous or may include any possible combination of heterologous and 
endogenous nucleic acid sequences. 

In various embodiments of the present invention, the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

1 5 protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
and/or thioredoxin-related proteins, are produced in a cell comprising oil bodies. 
As used herein the phrase "in a cell", "in the cell", or grammatical variations 
thereof, mean that the first and/or second recombinant polypeptides, multimeric- 

20 protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 
polypeptide-chains, redox-fusion-polypeptides, and/or thioredoxin-related 
proteins, may be produced in any cellular compartment of that cell, so long as 
that cell comprises oil bodies therein. In embodiments of the invention in which 

25 plant cells are used, the phrase is intended to include the plant apoplast. 

In various embodiments provided herein, the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 
protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 

30 and thioredoxin-related proteins, associate with an oil body through an oil-body- 
targeting-protein. As used herein, the phrase "oil-body-targeting-protein" refers 
to any protein, protein fragment or peptide capable of associating with an oil 
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body. Exemplary oil-body-targeting-proteins for use herein include oil-body- 
proteins, such as oleosin and caleosin; immunoglobulins, such as bi-specif ic 
antibodies; and the like. 

In embodiments described herein in which an oil-body-protein is used, the 
5 first and/or second recombinant polypeptides, multimeric-protein-complexes, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and thioredoxin-related proteins, are 
preferably fused to the oil-body-protein. The term n oil-body-protein n refers to 

10 any protein naturally present in cells and having the capability of association 
with oil bodies, including any oleosin or caleosin. 

Accordingly, provided herein a method of expressing a recombinant 
multimeric-protein-complex comprising a first and second recombinant 
polypeptide in a cell, said method comprising: 

15 (a) introducing into a cell a first chimeric nucleic acid sequence comprising: 

(i) a first nucleic acid sequence capable of regulating transcription in said cell 
operatively linked to; 

(ii) a second nucleic acid sequence encoding a first recombinant polypeptide, 
such as a redox protein, an immunoglobulin-polypeptide-chain or an thioredoxin- 

20 related protein, fused to an oil-body-protein; 

(b) introducing into said cell a second chimeric nucleic acid sequence comprising: 

(i) a third nucleic acid sequence capable of regulating transcription in said cell 
operatively linked to; 

(ii) a fourth nucleic acid sequence encoding a second recombinant polypeptide, 
25 such as a second redox protein, a second immunoglobulin-polypeptide-chain or a 

second thioredoxin-related protein,; 

(c) growing said cell under conditions to permit expression of said first and 
second recombinant polypeptide in a progeny cell comprising oil bodies wherein 
said first recombinant polypeptide and said second recombinant polypeptide are 

30 capable of forming a multimeric-protein-complex, preferably in said progeny cell; 
and 
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(d) associating said first recombinant polypeptide with an oil body through said 
oil-body-protein. 

The term "nucleic acid" as used herein refers to a sequence of nucleotide 
or nucleoside monomers consisting of naturally occurring bases, sugars and 
5 intersugar (backbone) linkages. The term also includes modified or substituted 
sequences comprising non-naturally occurring monomers or portions thereof, 
which function similarly. The nucleic acid sequences may be ribonucleic acids 
(RNA) or deoxyribonucleic acids (DNA) and may contain naturally occurring 
bases including adenine, guanine, cytosine, thymidine and uracil. The sequences 

10 also may contain modified bases such as xanthine, hypoxanthine, 2- 

aminoadenine, 6-methyl, 2-propyl and other alkyl adenines, 5-halo-uracil, 5-halo 
cytosine, 6-aza uracil, 6-aza cytosine and 6-aza thymine, pseudo uracil, 4- 
thiouracil, 8-halo adenine, 8-amino adenine, 8-thiol-adenine, 8-thio-alkyl 
adenines, 8-hydroxyl adenine and other 8-substituted adenines, 8-halo guanines, 

15 8 amino guanine, 8 thiol guanine, 8-thioalkyl guanines, 8 hydroxyl guanine and 
other 8-substituted guanines, other aza and deaza uracils, thymidines, cytosines, 
adenines, or guanines, 5-trifIuoromethyl uracil and 5-trifluoro cytosine. 
Multimeric-protein-complexes 

In accordance with the methods and compositions provided herein, any 

20 two recombinant polypeptides capable of forming a multimeric-protein-complex 
may be used. The nucleic acid sequences encoding the two recombinant 
polypeptides may be obtained from any biological source or may be prepared 
synthetically. In general nucleic acid sequence encoding multimeric proteins are 
known to the art and readily available. Known nucleic acid sequences encoding 

25 multimeric-protein-complexes may be used to design and construct nucleic acid 
sequence based probes in order to uncover and identify previously undiscovered 
nucleic acid sequences encoding multimeric-protein-complexes, for example, by 
screening cDIMA or genomic libraries or using 2- or multi-hybrid systems. Thus, 
additional nucleic acid sequences encoding multimeric-protein-complexes may be 

30 discovered and used as described herein. 

The first and/or second recombinant polypeptides that are comprised 
within a multimeric-protein-complex provided herein, can themselves be in the 
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form of heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or a first and/or second thioredoxin- 
related protein. 

5 The nucleic acid sequence encoding the first and second recombinant 

polypeptide, heteromuftimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins r immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or a first and/or second thioredoxin- 
related protein may be obtained from separate sources or may be obtained from 

10 the same source. In general however, such nucleic acid sequence is obtained 
from the same or a similar biological source. In certain embodiments wherein 
the nucleic acid sequence encoding the first and second recombinant polypeptide 
protein are obtained from the same source, the nucleic acid sequence encoding 
the first recombinant polypeptide and second recombinant polypeptide may be 

15 naturally fused. In accordance with a particular embodiment, the nucleic acid 
sequences encoding the first and second recombinant polypeptide are obtained 
from a plant source. 
Oil-Body-Surface-Avoiding Linkers 

Polypeptide spacers or linkers of variable length and/or negative charge 

20 can be used herein to separate the first and/or second recombinant polypeptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 
fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and the first 
and/or second thioredoxin-related proteins from the in-frame oil-body-targeting- 

25 protein, to improve activity of and/or the accessibility of the polypeptide or 

complex. For example, in one embodiment set forth herein, positioned between 
a nucleic acid sequence encoding a sufficient portion of an oil-body-protein and a 
nucleic acid sequence encoding either the first and/or second recombinant 
polypeptides, multimeric-protein-complexes, heteromultimeric-protein-complexes, 

30 multimeric-fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and the first 
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and/or second thioredoxin-related proteins; is a linker nucleic acid sequence 
encoding an oil-body-surface-avoiding linker amino acid sequence. 

Oil-body-surface-avoiding linkers are positioned between the oil-body 
targeting sequence and an in-frame recombinant polypeptide of interest, e.g., the 
5 multimeric-protein-complexes provided herein, serve to increase the distance and 
or decrease the interaction between the negatively charged oil body surface and 
the recombinant polypeptide of interest. A negatively charged linker is repelled 
by the negatively charged oil body surface, in turn increasing the distance or 
decreasing the interaction of its attached recombinant polypeptide with the oil 

10 body surface. As a consequence of the increased distance from the oil body 

surface, the recombinant polypeptide will be more accessible, e.g. to itstarget(s) 
substrate, protein substrate, protein partner, and less affected by the charged oil 
body surface. Exemplary linker sequences for use herein can be either a 
negatively charged linker, or a linker having a molecular weight of at least about 

15 35 kd or more. 

As used herein, a "negatively charged linker" sequence, refers to any 
amino acid segment, or nucleic acid encoding such, that has a pi less than or 
equal to the pi of an oil body. In certain embodiments, the pi of the negatively 
charged linker is about 90%, 80%, 70%, 60%, 50%, 40%, 30%, down to 

20 about 25% or more, below that of the pi of an oil body in the particular plant or 
cell system being used. Exemplary negatively charged linkers can be prepared 
comprising any combination of the negatively charged amino acid residues. For 
example, in one embodiment, a negatively charged linker comprises either a 
poly-glutamate or poly-aspartate sequence, or any combination of both amino 

25 acid residues. The negatively charged linker is typically at least 5, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or more amino 
acids in length. The negatively charged linkers are preferably non-proteolytic 
(e.g., non-proteolytic linkers), having no site for efficient proteolysis. When 
linker size rather than charge is used to minimize interaction of the recombinant 

30 polypeptide of interest with the oil body surface, then the linker is non- 
proteolytic and ranges in molecular weight from about 35 kd up to about 100 
kd. The upper size limit is chosen such that the expression of, the activity of, 
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the conformation of, and/or the access to target of, the recombinant polypeptide 
of interest is not significantly affected by the linker. 

In certain embodiments, described herein where a non-proteolytic linker 
amino acid sequence is employed, the gene fusion or protein fusion (multimeric- 
5 fusion-protein) can optionally further comprise a linker nucleic or amino acid 
sequence encoding a sequence that is specifically cleavable by an enzyme or a 
chemical, wherein the linker sequence is positioned between the non-proteolytic 
linker sequence and sequence encoding the desired recombinant protein region, 
e.g., the first and/or second recombinant polypeptides, multimeric-protein- 

10 complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins set forth herein. When a cleavable linker sequence is used herein, in a 
particular embodiment, it is further downstream than the non-proteolytic linker 

15 sequence from the oil-body-targeting-protein region of the fusion protein. By 
virtue of cleavable linker, the recombinant fusion polypeptides provided herein, 
such as the multimeric-fusion-proteins and redox fusion polypeptides, can be 
isolated and purified by introducing an enzyme or chemical that cleaves said 
multimeric-fusion-protein and/or redox fusion polypeptide from said oil body, 

20 thereby obtaining and/or isolating the desired protein. It is contemplated herein 
that the use of cleavable linker sequence downstream of the non-proteolytic 
linker/spacer sequence will improve the yield of protein recovery when isolating 
or purifying proteins using the methods provided herein. 

The nucleic acid sequences encoding the first or second recombinant 

25 polypeptide may be altered to improve expression levels for example, by 

optimizing the nucleic acids sequence in accordance with the preferred codon 
usage for the particular cell type which is selected for expression of the first and 
second recombinant polypeptide, or by altering of motifs known to destabilize 
mRNAs (see for example: PCT Patent Application 97/02352). Comparison of 

30 the codon usage of the first and second recombinant polypeptide with codon 
usage of the host will enable the identification of codons that may be changed. 
For example, typically plant evolution has tended towards a preference for CG 
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rich nucleotide sequences while bacterial evolution has resulted in bias towards 
AT rich nucleotide sequences. By modifying the nucleic acid sequences to 
incorporate nucleic acid sequences preferred by the host cell, expression may be 
optimized. Construction of synthetic genes by altering codon usage is described 
5 in for example PCT patent Application 93/07278. The first and second 

recombinant polypeptide can be altered using for example targeted mutagenesis, 
random mutagenesis (Shiraishi et al. (1998) Arch. Biochem. Biophys. 358: 104- 
115; Gaikin et al. (1997) Protein Eng. 10: 687-690; Carugo et al. (1997) 
Proteins 28: 10-28; Hurley et al. (1996) Biochemistry 35: 5670-5678), gene 
10 shuffling, and/or by the addition of organic solvent (Holmberg et al. (1999) 
Protein Eng. 12: 851-856). Any polypeptide spacers that are used in 
accordance with the methods and products provided herein may be altered in 
similar ways. 

In particular embodiments provided herein, the recombinant polypeptides 
15 or thioredoxin-related proteins capable of forming a multimeric-protein-complex 
are capable of forming a heteromultimeric-protein-complex. Examples of 
heteromultimeric-protein-complexes that contain polypeptide chains that 
repeatedly interact, either to activate, inactivate, oxidize, reduce, stabilize, etc., 
with one another, that can be produced in association with oil bodies using the 
20 methods provided herein include those set forth in Figure 5. Accordingly, 

exemplary proteins for use in the heteromultimeric-protein-complexes and nucleic 
acid constructs encoding such, provided herein include, among others described 
herein, those set forth in Figure 5. 

Other polypeptide regions that can be used in the first and/or second 
25 recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins, provided herein include, 
among other, those immunoglobulin regions set forth in Table 1 . 
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TABLE 1 - ANTIBODY HETERODIMERS 

Class or molecule Subunits 

Fab Variable region and first constant region of 

heavy chain and complete light chain 
5 Fv Variable regions of heavy and light 

antibody chains 

IgA heavy chains, light chains and J (joining) 

chain 

IgG, IgD, IgE heavy and light chains 

IgM heavy chains, light chains and J (joining) 

chain 

Antibody chain(s) and a toxin Antibody chain(s) and a toxin 
1 0 Autoantigens, allergens and Autoantigens, allergens and transplant 

transplant antigens with an antigens with an adjuvant or tolerogen 
adjuvant or tolerogen 

Chimeras using antibody Fc Receptor subunits fused to the constant 
domain region of antibody heavy chains 



15 



20 



As set forth above, in one embodiment, exemplary heteromultimeric- 
protein-complexes and exemplary heteromultimeric-fusion-proteins provided 
herein comprise redox proteins, such as the thioredoxins and thioredoxin- 
reductases and immunoglobulins. 



Oil-body-targeting-proteins 

The nucleic acid sequence encoding the oil-body-targeting-protein that 
may be used in the methods and compositions provided herein may be any 
nucleic acid sequence encoding an oil-body-targeting-protein, protein fragment or 

25 peptide capable of association with first recombinant polypeptide, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or a first and/or second thioredoxin- 
related protein and the oil bodies. The nucleic acid sequence encoding the oil 

30 body targeting peptide may be synthesized or obtained from any biological 
source. 

For example, in one embodiment the oil-body-targeting-protein is an 
immunoglobulin or an immunoglobulin derived molecule, for example, 9 bispecific 
single chain antibody. The generation of single chain antibodies and bi-specific 
35 single chain antibodies is known to the art (see, e.g., US Patents US 5,763,733, 
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1135,767,260 and US5,26O,203). Nucleic acid sequences encoding single chain 
antibodies functioning as oil-body-targeting-proteins may be prepared from 
hybridoma cell lines expressing monoclonal antibodies raised against an oleosin 
as described by Alting-Mees et al (2000) IBC's Annual International Conference 
5 on Antibody Engineering, Poster #1 . In order to attain specificity for the first 
recombinant polypeptide a nucleic acid sequence encoding a second single chain 
antibody prepared from a monoclonal raised against the first recombinant 
polypeptide may be prepared and linked to the anti-oleosin single chain antibody. 
In this embodiment the oil body associates with the first recombinant 

10 polypeptide through non-covalent interactions of the oil-body-targeting-protein 
with the first recombinant polypeptide and the oil body. Alternatively the first 
recombinant polypeptide may be prepared as a fusion protein with an oil-body- 
targeting-protein. For example, a nucleic acid sequence encoding a single chain 
antibody raised against an oleosin may be fused to a nucleic acid sequence 

15 encoding the first recombinant polypeptide 

Non-immunoglobulin-based oil-body-targeting-proteins capable of 
association with the first recombinant polypeptide may be discovered and 
prepared using for example phage display techniques (Pharmacia Biotech 
Catalogue Number 27-9401-011 Recombinant Phage Antibody System 

20 Expression Kit). 

Oil-body-targeting-proteins may also be chemically modified. For 
example, oleosins may be modified by changing chemical modification of the 
lysine residues using chemical agents such as biotinyl-N-hydroxysuccinimide 
ester resulting in a process referred to as biotinylation. Conveniently this is 

25 accomplished by in vitro biotinylation of the oil bodies. In vivo biotinylation may 
be accomplished using the biotinylation domain peptide from the biotin carboxy 
carrier protein of E coli acetyl-CoA carboxylase (Smith et al. (1998) Nucl. Acids. 
Res. 26: 1414-1420). Avidin or streptavidin may subsequently be used to 
accomplish association of the redox protein with the oil body. 

30 In a particular embodiment the oil-body-targeting-protein is an oil-body- 

protein such as for example an oleosin or a caleosin or a sufficient portion 
derived thereof capable of targeting to an oil body. Nucleic acid sequences 
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encoding oleosins are known to the art. These include for example the 
Arabidopsis oleosin (van Rooijen et al (1991) Plant MoL Bio. 18:1 177-1 179); the 
maize oleosin (Qu and Huang (1990) J. Biol. Chem. Vol. 265 4:2238-2243); 
rapeseed oleosin (Lee and Huang (1991) Plant Physiol. 96:1395-1397); and the 
5 carrot oleosin (Hatzopoulos et al (1990) Plant Cell Vol. 2, 457-467.). Caleosin 
nucleic acid sequences are also known to the art (Naested et al (2000) Plant Mol 
Biol. 44(4):463-476; Chen et al (1999) Plant Cell Physiol. 40(1O):1079-1086). 
Animal cell derived oil body proteins that may be used herein include adopihilin 
(Brasaemle et al, (1997) J. Lipid Res., 38: 2249-2263; Heid et al. (1998) Cell 

10 Tissue Research 294: 309-321), perilipin (Blanchette-Mackie et al. (1995), J. 
Lipid Res. 36: 121 1-1226; Servetnick et al. (1995) J. Biol. Chem. 270: 16970- 
16973), apolipoproteins such as apo A-l, A-ll, A-IV, C-l, C-ll, Clll (Segrest et al. 
. (1990), Proteins 8:103-1 17) and apoB (Chatterton et al. (1995) J. Lipid Res. 36: 
2027-2037; Davis, RA in: Vance DE, Vance J. editors. Lipoprotein structure and 

15 secretion. The Netherlands, Elsevier, 191: 403-426. 

In one embodiment, the first recombinant polypeptide is fused to an oil- 
body-protein. The methodology is further described in US patent 5,650,554, 
which is incorporated herein by reference in its entirety. The first recombinant 
polypeptide may be fused to the N-terminus as well as to the C-terminus of the 

20 oil-body-protein (as described in: Moloney and van Rooijen (1996) INFORM 
7:107-1 13) and fragments of the oil-body-protein such as for example the 
central domain of an oleosin molecule, or modified versions of the oil-body- 
protein may be used. In this embodiment, the second recombinant polypeptide 
is expressed intracellularly and then intracellularlly associates with the first 

25 recombinant polypeptide to form the multimeric-protein-complex in the cell. Oil 
bodies comprising the multimeric-protein-complex are then conveniently isolated 
from the cells. 

In a further embodiment both the first and second recombinant 
polypeptide are separately fused to an oil-body-protein. In this embodiment 
30 nucleic acid sequences encoding the first and second polypeptides may be 
prepared separately and introduced in separate cell lines or they may be 
introduced in the same cell lines. Where the nucleic acid sequences are 
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introduced in the same cell line, these nucleic acid sequence may be prepared 
using two separate expression vectors, or they may be prepared using a single 
vector comprising nucleic acid sequences encoding both the first polypeptide 
fused to an oil body protein and the second polypeptide fused to an oil-body- 
5 protein. Where separate cell lines are used subsequent mating of the offspring 
(e.g.. mating of plants) is used to prepare a generation of cells comprising oil 
bodies which comprise both the first and second recombinant polypeptide fused 
to an oil-body-protein. 

In further alternate embodiment, the first and second recombinant 

10 polypeptide are fused to form a multimeric-fusion-protein comprising the 
multimeric-protein-complex. In such an embodiment, the first and second 
polypeptide is associated with the oil body through an oil-body-targeting-protein 
capable of associating with both the fusion protein and with the oil body. In a 
particular embodiment, the fusion protein comprising the multimeric-protein- 

15 complex is fused to an oil-body-protein, for example, an oleosin or caleosin. 

In embodiments provided herein in which the multimeric-protein-complex 
is an immunoglobulin (e.g., a multimeric-immunoglobulin-complex), a particularly 
preferred oil body targeting protein is an oleosin or caleosin associated with an 
immunoglobulin binding protein, such as for example protein A (US Patent 

20 5,151,350), protein L (US Patent 5,965,390) and protein G (US Patent 
4,954,618), or active fragments of such immunoglobulin binding proteins. 

New oil-body-proteins may be discovered for example by preparing oil 
bodies (described in further detail below) and identifying proteins in these 
preparations using for example SDS gel electrophoresis. Polyclonal antibodies 

25 may be raised against these proteins and used to screen cDNA libraries in order 
to identify nucleic acid sequences encoding oil-body-proteins. The 
methodologies are familiar to the skilled artisan (Huynh et al. (1985) in DNA 
Cloning Vol. 1. a Practical Approach ed. DM Glover, IRL Press, pp 49-78). New 
oil-body-proteins may further be discovered using known nucleic acid sequences 

30 encoding oil-body-proteins (e.g. the Arabidopsis, rapeseed, carrot and corn 

nucleic acid sequences) to probe for example cDNA and genomic libraries for the 
presence of nucleic acid sequences encoding oil-body-proteins. 
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In one embodiment, the first and second polypeptide are a first and 
second redox protein. Accordingly, one embodiment provided herein relates to 
novel and improved methods for the production of redox proteins. It has 
unexpectedly been found that a redox protein when prepared as a fusion protein 
5 with a second redox protein is fully enzymatically active when produced in 
association with an oil body. In contrast, when the redox protein is prepared 
without the second redox protein it has reduced enzymatic activity. In one 
embodiment, the first redox protein is at least 5 times more active when 
produced as a redox fusion polypeptide relative to production as a non-fusion 
10 polypeptide. 

Accordingly, provided herein are methods for producing an oil body 
associated with a heteromultimeric redox protein complex, said method 
comprising: 

(a) producing in a cell comprising oil bodies, a first redox protein and a 
15 second redox protein wherein said first redox protein is capable of interacting 

with said second redox protein, preferably in the cell, to form said 
heteromultimeric redox protein complex; and 

(b) associating said heteromultimeric redox protein complex with an oil 
body through an oil-body-targeting-protein capable of associating with said oil 

20 bodies and said heteromultimeric redox protein complex. 

In a particular embodiment the first and second redox protein are 
prepared as a fusion protein to form a redox fusion polypeptide. Accordingly, 
provided herein are methods for preparing an enzymatically active redox protein 
associated with oil bodies comprising: 
25 a) producing in a cell a redox fusion polypeptide comprising a first redox 

protein linked to a second redox protein; 

b) associating said redox fusion polypeptide with oil bodies through an 
oil-body-targeting-protein capable of associating with said redox fusion 
polypeptide and said oil bodies; and 
30 c) isolating said oil bodies associated with said redox fusion polypeptide. 

The oil bodies in association with the redox protein may be used to prepare a 
variety of useful emulsions. 
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As used herein the phrase "redox proteins" or grammatical variations 
thereof, refers to any protein or active protein fragment capable of participating 
in electron transport. For example, redox proteins are capable of catalyzing the 
transfer of an electron from an electron donor (also frequently referred to as the 
5 reducing agent) to an electron acceptor (also frequently referred to as the 

oxidizing agent). In the process of electron transfer, the reducing agent (electron 
donor) is oxidized and the oxidizing agent (electron acceptor) is reduced. 
Exemplary redox proteins for use herein include iron-sulfur proteins, 
cytochromes, redox active thiol proteins and redox-active flavoproteins. To 
10 carry out their function as conduits for electrons, redox proteins, such as 
thioredoxin and thioredoxin-reductase for example, are known to function by 
interacting or associating with one another in multimeric-protein-complexes (e.g., 
heteromultimeric-protein-complexesh 

The term "redox fusion polypeptide" as used herein refers to any fusion 
15 polypeptide comprising a first redox protein linked to a second redox protein 
(e.g., an in-frame translational fusion). The redox proteins that may be used 
with the methods and compositions provided herein may be any redox protein. 
In one embodiment the first and second redox proteins are a pair of redox 
proteins that would normally occur together from the same source, in nature. In 
20 a particular embodiment, the first redox protein is a thioredoxin and the second 
redox protein is a thioredoxin-reductase. 

The redox fusion polypeptide may be produced in any cell comprising oil 
bodies, including any animal cell, plant cell, algae cell, fungal cell or bacterial 
cell. In certain embodiments the redox fusion polypeptide is produced in a plant 
25 cell and in particular embodiments the redox fusion polypeptide is produced in 
the seed cells of a seed plant. 

In particular embodiments the oil-body-targeting-protein that is used is an 
oil-body-protein. In embodiments of the present invention in which an oil-body- 
protein is used, the first and second redox protein are preferably covalently fused 
30 to the oil-body-protein. Accordingly, provided herein are methods for the 
preparation of a redox protein in association with an oil body comprising: 

a) introducing into a cell a chimeric nucleic acid sequence comprising: 
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1 ) a first nucleic acid sequence capable of regulating transcription in 
said cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a first nucleic acid sequence encoding a 

5 sufficient portion of an oil-body-protein to provide targeting of said 

recombinant fusion polypeptide to an oil body linked in reading 
frame to (ii) a second nucleic acid sequence encoding a redox 
fusion polypeptide comprising a first redox protein linked to a 
second redox protein operatively linked to; 
10 3) a third nucleic acid sequence capable of terminating transcription 

in said cell; 

b) growing said cell under conditions to permit expression of said 
redox fusion polypeptide in a progeny cell comprising oil bodies; 
and 

15 c) isolating said oil bodies comprising said redox fusion polypeptide 

from said progeny cell. 



Redox Proteins 

In accordance with various methods and compositions provided herein, 
20 any nucleic acid sequence encoding a redox protein may be used. The nucleic 
acid sequence encoding the first and/or second redox protein may be obtained 
from any biological source or may be prepared synthetically. In general, nucleic 
acid sequences encoding redox proteins are well known in the art and readily 
available. See, for example: Cristiano et al. (1993) Genomics 17: (2) 348-354, 
25 Doyama et al. (1998) Plant Sci. 137: 53-62, Hoeoeg et al. (1984) Biosci. Rep. 4 
917-923; as well as the Swiss Protein sequences set forth in Table 5. Known 
nucleic acid sequences encoding redox proteins may be used to design and 
construct nucleic acid sequence based probes in order to uncover and identify 
previously undiscovered nucleic acid sequences encoding redox proteins, for 
30 example by screening cDNA or genomic libraries. Thus, additional nucleic acid 
sequences may be discovered and used in accordance with the present 
invention. 
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The nucleic acid sequence encoding the first and/or second redox protein 
may be obtained from separate sources or may be obtained from the same 
source. In general however, the nucleic acid sequence encoding a redox-fusion 
polypeptide comprises nucleic acid sequences encoding a first and a second 
5 redox protein obtained from the same or a similar biological source. In certain 
embodiments provided herein, wherein the nucleic acid sequence encoding the 
first and second redox protein is obtained from the same source, the nucleic acid 
sequence encoding the first redox protein and second redox protein may be 
naturally fused. In accordance with a particular embodiment, the nucleic acid 

10 sequences encoding the first and second redox protein are preferably obtained 
from a plant source. 

As set forth above, a polypeptide spacer or linker of variable length may 
separate the first and second redox proteins from each other and/or from the oil- 
body-targeting-protein; and additional redox proteins (e.g., one or more) may be 

15 fused to the first and/or second redox protein. 

The nucleic acid sequences encoding the redox proteins may be altered to 
improve expression levels for example by optimizing the nucleic acids sequence 
in accordance with the preferred codon usage for the particular cell type which is 
selected for expression of the redox proteins, or by altering of motifs known to 

20 destabilize m RIM As (see for example: PCT Patent Application 97/02352). 

Comparison of the codon usage of the redox protein with codon usage of the 
host will enable the identification of codons that may be changed. For example, 
typically plant evolution has tended towards a preference for CG rich nucleotide 
sequences while bacterial evolution has resulted in bias towards AT rich 

25 nucleotide sequences. By modifying the nucleic acid sequences to incorporate 
nucleic acid sequences preferred by the host cell, expression may be optimized. 
Construction of synthetic genes by altering codon usage is described in for 
example PCT patent Application 93/07278. The redox proteins may be altered 
using for example, targeted mutagenesis, random mutagenesis (Shiraishi et al. 

30 (1998) Arch. Biochem. Biophys. 358: 104-1 15; Galkin et al. (1997) Protein Eng. 
10: 687-690; Carugo et al. (1997) Proteins 28: 10-28; Hurley et al. (1996) 
Biochemistry 35: 5670-5678) (and/or by the addition of organic solvent 
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(Holmberg et al. (1999) Protein Eng. 12: 851-856). The polypeptide spacer 
between the first and second redox protein may be altered in similar ways. 

The first and second redox protein may be selected by developing a two- 
dimensional matrix and determining which combination of first and second redox 
5 protein is most effective in electron transport using for example, a colorometric 
reduction assay (Johnson et al (1984) J. of Bact. Vol, 158 3:1061-1069, 
Luthman et al (1982) Biochemistry Vol 21 26:6628-2233). Combinations of 
thioredoxin and thioredoxin-reductase may be tested by determining the 
reduction of wheat storage proteins and milk storage protein beta-lactoglobulin in 

10 vitro (Del Val et al. (1999) J. Allerg. Clin. Immunol. 103: 690-697). Using the 
same strategy polypeptide spacers between the first and second redox proteins 
may be evaluated for their efficiency. 

First and second redox proteins that may be used herein include without 
limitation any first redox protein and second redox protein selected from the 

15 group of redox proteins consisting of cytochromes, such as cytochrome a, 
cytochrome b and cytochrome c; porphyrin containing proteins, for example 
hemoglobin; iron-sulfur proteins, such as ferredoxin; flavoproteins such as 
thioredoxin-reductase, NADH dehydrogenase, succinate dehydrogenase, 
dihydrolipoyl dehydrogenase, acyl-CoA dehydrogenase, D-amino acid oxidase, 

20 xanthine oxidase, orotate reductase and aldehyde oxidase; pyridine-Iinked 
dehydrogenases, for example, lactate dehydrogenase, glyceraldehyde-3- 
phosphate dehydrogenase, malate dehydrogenase, and beta-hydroxy-butarate 
dehydrogenase; and redox active thiol containing proteins such as thioredoxin. 
In particular embodiments, the redox proteins provided herein are 

25 thioredoxin and its reductant thioredoxin-reductase (which are jointly also 

referred to herein as "thioredoxin-related" protein(s)). As used herein, the term 
"thioredoxin" refers to relatively small proteins (typically approximately 12 kDa) 
that belong to the family of thioltransferases which catalyze oxido-reductions via 
the formation or hydrolysis of disulfide bonds and are widely, if not universally, 

30 distributed throughout the animal plant and bacterial kingdom. The reduces form 
of thioredoxin is an excellent catalyst for the reduction of even the most 
intractable disulfide bonds. In order to reduce the oxidized thioredoxin, two 
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cellular reductants provide the reduction equivalents: reduced ferredoxin and 
NADPH. These reduction equivalents are supplied to thioredoxin via interaction 
or association with different thioredoxin-reductases including the NADPH 
thioredoxin-reductase and ferredoxin thioredoxin-reductase. The supply of these 
5 reduction equivalents requires the formation of a heteromultimeric-protein- 
.complex comprising thioredoxin and thioredoxin-reductase. Ferredoxin 
thioredoxin-reductase is involved in the reduction of plant thioredoxins 
designated as Trxf and Trxm, both of which are involved in the regulation of 
photosynthetic processes in the chloroplast. The NADPH/thioredoxin active in 

10 plant seeds is designated Trxh (also referred to herein as thioredoxin h-type) and 
is capable of the reduction of a wide range of proteins thereby functioning as an 
important cellular redox buffer. Generally, only one kind of thioredoxin, which 
analogous to the plant Trxh type, is found in bacterial or animal cells. The h- 
type thioredoxins are capable of being reduced by NADPH and NADPH- 

15 thioredoxin reductase. 

Exemplary thioredoxins are further characterized as a protein having a 
core of 5 beta-sheets surrounded by 4 to 6 alpha helixes. Exemplary 
thioredoxins are further characterized by having an active site containing the 
consensus amino acid sequence: 

20 XCYYCZ, 

wherein Y is any amino acid, such as hydrophobic or non-polar amino acids, 
wherein X can be any of the 20 amino acids, preferably a hydrophobic amino 
acid, such as a tryptophan, and 

Z can be any amino acid, preferably polar amino acids. 
25 In certain embodiments, the thioredoxins for use herein comprise an active site 
having the amino acid sequence XCG.PCZ. 

When the cysteines in the active site of thioredoxin or thioredoxin-like proteins 
are oxidized, they form an intramolecular disulfide bond. In the reduced state, 
the same active sites are capable of participating in redox reactions through the 
30 reversible oxidation of its active site dithiol, to a disulfide and catalyzes 
dithioldisulfide exchange reactions. 
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Exemplary thioredoxins are well-known in the art and can be obtained 
from several organisms including Arabidopsis thaliana (Riveira Madrid et al. 
(1995) Proc. Natl. Acad. Sci. 92: 5620-5624), wheat (Gautier et al. (1998) Eur. 
J. Biochem. 252: 314-324); Escherichia coli (Hoeoeg et al (1984) Biosci. Rep. 4: 
5 917-923) and thermophylic microorganisms such as Methanococcus jannaschii 
and Archaeoglobus fulgidus (PCT Patent Application 00/36126). Thioredoxins 
have also been recombinantly expressed in several host systems including 
bacteria (Gautier et al. (1998) Eur J. Biochem. 252: 314-324) and plants (PCT 
Patent Application WO 00/58453) Commercial preparations of E. coli sourced 

10 Thioredoxins are readily available from for example: Sigma Cat No. T 0910 
Thioredoxin (£. coli, recombinant; expressed in E. coli). 

Exemplary nucleic acid sequences encoding thioredoxin polypeptides for 
use herein are readily available from a variety of diverse biological sources 
including £. coli (Hoeoeg et al. (1984) Biosci. Rep.: 4 917-923); Methanococcus 

15 jannaschii and Archaeoglobus fulgidus (PCT Patent Application 00/36126); 
Arabidopsis thaliana (Rivera-Madrid (1995) Proc. Natl. Acad. Sci. 92: 5620- 
5624); wheat (Gautier et al (1998) Eur. J. Biochem. 252(2): 314-324); tobacco 
(Marty et al. (1991) Plant Mol. Biol. 17: 143-148); barley (PCT Patent 
Application 00/58352); rice (Ishiwatari et al. (1995) Planta 195: 456-463); 

20 soybean (Shi et al. (1996) Plant Mol. Biol. 32: 653-662); rapeseed (Bower et al. 
Plant Cell 8: 1641-1650) and calf (Terashima et al. (1999) DNA Seq. 10(3): 
203-205); and the like. In yet other embodiments, exemplary nucleic acids for 
use herein include those encoding the thioredoxin and thioredoxin-like 
polypeptide chains set forth as SEQ ID NOs:38, 42, 46 and 50; and those 

25 encoding the thioredoxin and thioredoxin-like polypeptide chains set forth in 
Table 5 as SEQ ID NOs:52-194. The respective nucleic acid sequences 
encoding the amino acids set forth in SEQ ID NOs:52-194 can be readily 
identified via the Swiss Protein identifier (accession) numbers provided in Table 5 
(in parenthesis). 

30 As us d herein, the term "thioredoxin-reductase" refers to a protein that 

complexes with a flavin, such as FAD. The flavin compound serves as an 
electron donor for the thioredoxin-reductase protein active site. Thioredoxin 
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reductases have a redox active, disulfide bond site capable of reducing 
thioredoxin. The active site of thioredoxin-reductase contains 2 cysteines. The 
type of amino acids surrounding the 2 cysteine residues forming the active site 
can vary as hydrophobic, non-polar or polar. An exemplary thioredoxin- 
5 reductase is NADPH-thioredoxin-reductase (NTR), which is a cytosolic 
homodimeric enzyme comprising typically 300-500 amino acids, Crystal 
structures of both £ coli and plant thioredoxin-reductase have been obtained 
(Waksman et al. (1994) J. Mol. Biol. 236: 800-816; Dai et al. (1996) J. Mol. 
Biol. 264:1044-1057). NADPH-thioredoxin-reductases have been expressed in 
10 heterologous hosts, for example the Arabidopsis NADPH-thioredoxin-reductase 
has been expressed in £ coli (Jacquot et al. (1994) J. Mol. Biol. 235: 1357- 
1363) and wheat (PCT Patent Application 00/58453). 

Exemplary nucleic acid sequences encoding thioredoxin-reductase 
proteins can readily be obtained from a variety of sources, such as from the 
15 sequence set forth in Table 5 and the Sequence Listing provide herein, from 
Arabidopsis (Riveira Madrid et al. (1995) Proc. Natl. Acad. Sci. USA 92: 5620- 
5624), £ coli (Russel et al. (1988) J. Biol. Chem. 263: 9015-9019); barley (PCT 
Patent Application 00/58352 and wheat (Gautier et ah, (1998) Eur. J. Biochem. 
252: 314-324); and the like. In yet other embodiments, exemplary nucleic acids 
20 for use herein include those encoding the thioredoxin-reductase polypeptide 
chains set forth as SEQ ID NOs:8, 9, 10, 40, 44, 48 and 50; and those 
encoding the thioredoxin-reductase polypeptide chains set forth in Table 5 as 
SEQ ID NOs: 195-3 13. The respective nucleic acid sequences encoding the 
amino acids set forth in SEQ ID N0s:195-313 can be readily identified via the 
25 Swiss Protein identifier (accession) numbers provided in Table 5 (in parenthesis). 
Also contemplated for use in the methods and compositions provided 
herein are nucleic acid and amino acid homologs that are "substantially 
homologous" to the thioredoxin and thioredoxin-reductase nucleic and amino 
acids set forth herein, which includes thioredoxin and thioredoxin-reductase 
30 polypeptides encoded by a sequence of nucleotides that hybridizes urider 

conditions of low, moderate or high stringency to the sequence of nucleotides 
encoding the thioredoxin and thioredoxin-reductase nucleic and amino acids set 
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forth herein (e.g., in the Examples, Sequence Listing and/or Table 5). As used 
herein, a DNA or nucleic acid homolog refers to a nucleic acid that includes a 
preselected conserved nucleotide sequence, such as a sequence encoding a 
therapeutic polypeptide. By the term "substantially homologous" is meant 
5 having at least 80%, preferably at least 90%, most preferably at least 95% 
homology therewith or a less percentage of homology or identity and conserved 
biological activity or function. 

The terms "homology" and "identity" are often used interchangeably. In 
this regard, percent homology or identity may be determined, for example, by 

10 comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch {J. Mo/. Biol. 
48:443 (1970), as revised by Smith and Waterman (Adv. Appl. Math. 2:482 
(1981). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 

15 number of symbols in the shorter of the two sequences. The preferred default 
parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 

20 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1 979); (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 
symbol in each gap; and (3) no penalty for end gaps. 

By sequence identity, the number of conserved amino acids are 
determined by standard alignment algorithms programs, and are used with 

25 default gap penalties established by each supplier. Substantially homologous 
nucleic acid molecules would hybridize typically at moderate stringency or at 
high stringency all along the length of the nucleic acid of interest. Preferably the 
two molecules will hybridize under conditions of high stringency. Also 
contemplated are nucleic acid molecules that contain degenerate codons in place 

30 of codons in the hybridizing nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
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determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. Natl. 
Acad. Sci. USA 35:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database may be used to 
5 determine relative sequence identity. 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 

10 and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Parti, Griffin, A.M., and Griffin, H.G., 
eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). 

15 While there exist a number of methods to measure identity between two 

polynucleotide or polypeptide sequences, the term "identity" is well known to 
skilled artisans (Carillo, H. & Lipton, D„ S/AM J Applied Math 45:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 

20 Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo, H. & Lipton, D., S/AM J Applied Math 43:1073 (1988). Methods to 
determine identity and similarity are codified in computer programs. Preferred 
computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., 

25 et aL, Nucleic Acids Research 72W:387 (1 984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., et aL, J Molec Biol 2/5:403 (1990)). 

Therefore, as used herein, the term "identity" represents a comparison 
between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 

30 identical to a reference polypeptide. 

As used herein, the term at least "90% identical to" refers to percent 
identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
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level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 100 amino acids are 
compared. No more than 10% (i.e., 10 out of 100) amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 
5 may be made between a test and reference polynucleotides. Such differences 
may be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they may be clustered in one or more 
locations of varying length up to the maximum allowable, e.g. 10/100 amino 
acid difference (approximately 90% identity). Differences are defined as nucleic 
10 acid or amino acid substitutions, or deletions. 

As used herein: stringency of hybridization in determining percentage 
mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1 % SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 
1 5 3) low stringency: 1 ,0 x SSPE, 0.1 % SDS, 50°C 

Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g. , Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual , Cold Spring 
Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 

20 that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.18 NaCI. Further, those of skill in the art recognize that the stability 
of hybrids is determined by T m , which is a function of the sodium ion 
concentration and temperature (T m = 81 .5° C-1 6.6(Iog 10 [Na + ]) + 0.41 (%G + C)- 
600/I)), so that the only parameters in the wash conditions critical to hybrid 

25 stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows (see also 
Shilo and Weinberg, Proc. Natl. Acad. Sci. USA, 78:6789-6792 (1981)): Filters 

30 containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.1 % PVP, 0.1 % 
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Ficoll, 1% BSA, and 500 //g/ml denatured salmon sperm DNA (10X SSC is 1.5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 

In a particular embodiment, a heteromultimeric-protein-complex is 
produced as a fusion polypeptide between the first and second redox protein, 
5 wherein the first redox protein is thioredoxin and the second redox protein is a 
thioredoxin-reductase. In one embodiment, the second recombinant polypeptide, 
e.g., the thioredoxin-reductase is positioned N-terminal relative to the first 
recombinant polypeptide, e.g., the thioredoxin. Accordingly, any protein which 
is classified as thioredoxin, such as the thioredoxin component of the NADPH 

10 thioredoxin system and the thioredoxin present in the ferredoxin/thioredoxin 
system also known as TRx and TRm may be used in combination with any 
thioredoxin-reductase such as the NADPH thioredoxin-reductase and the 
ferredoxin-thioredoxin-reductase and any other proteins having the capability of 
reducing thioredoxin. In particular embodiments the thioredoxin and thioredoxin- 

15 reductase are plant derived. 

In an alternate embodiment, the naturally occurring nucleic acid sequence 
encoding the thioredoxin/thioredoxin-reductase protein fusion obtainable from 
Mycobacterium leprae (Wieles et al. (1995) J. Biol. Chem. 27:25604-25606) is 
used, as set forth in the Examples herein. 

20 Immunoglobulins 

In another embodiment of the present invention, the multimeric- protein- 
complexes are immunoglobulins. As used herein "immunoglobulin-polypeptide- 
chain" refers to a first polypeptide that is capable of associating with a second 
polypeptide to form an immunologically active (i.e. capable of antigen brnding) 

25 multimeric-protein-complex. The types of immunoglobulins and immunoglobulin- 
polypeptide-chains contemplated for use herein include the immunologically 
active (i.e. antigen binding) portions of a light and heavy chain. Exemplary 
immunoglobulins and immunoglobulin-polypeptide-chains for use herein include 
substantially intact immunoglobulins, including any IgG, IgA, IgD, IgE and IgM, 

30 as well as any portion of an immunoglobulin, including those portions yvell- 
known as Fab fragments, Fab' fragments, F{ab').sub2. fragments and Fv 
fragments. 
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ln this embodiment, the first recombinant polypeptide may be any 
immunoglobulin heavy chain, including any IgG, IgA, IgD, IgE or IgM heavy 
chain, and the second recombinant polypeptide may be a kappa or lambda 
immunoglobulin light chain. Accordingly, provided herein are methods of 
5 producing an immunoglobulin, said method comprising: (a) producing in a cell 
comprising oil bodies, a first immunoglobulin-polypeptide-chain and a second 
immunoglobulin-polypeptide-chain wherein said first irnmunoglobulin-polypeptide- 
chain is capable of associating with said second immunoglobulin-polypeptide- 
chain to form said immunoglobulin; and (b) associating said immunoglobulin with 

10 an oil body through an oil-body-targeting-protein capable of associating with said 
oil bodies and said first immunoglobulin-polypeptide-chain. 

As set forth herein, the multimeric immunoglobulin is associated with an 
oil body through an oil-body-targeting-protein. In particular embodiments, the 
oil-body-targeting-protein may be a fusion polypeptide comprising an 

1 5 oil-bpdy-protein and an immunoglobulin binding protein, such as for example 
protein A, protein L, and protein G. 

In yet another embodiment involving immunoglobulins, the first and 
second recombinant polypeptides (immunoglobulins) are separately fused to an 
oil body protein, for example an oleosin or caleosin. For example, 

20 a) the first recombinant polypeptide may be an immunoglobulin heavy 

chain, including any IgG, IgA, IgD, IgE or IgM heavy chain, and the second 
recombinant polypeptide may be a kappa or lambda immunoglobulin light chain; 
or 

b) the first recombinant polypeptide may be the variable and first 
25 constant domain from an immunoglobulin heavy chain and the second 

recombinant polypeptide may be a kappa or lambda immunoglobulin light chain; 
or 

c) the first recombinant polypeptide may be the variable domain from 
an immunoglobulin heavy chain and the second recombinant polypeptide may be 

30 the variable domain from a kappa or lambda immunoglobulin light chain. 

In certain embodiments, the fusion polypeptides are designed or selected 
to allow the heteromultimeric-protein-complex formation between 
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immunoglobulin light and heavy chain sequences on the oil bodies within the cell 
comprising oil bodies. 

Preparation of expression vectors comprising oil-body-targeting-proteins and the 
first and/or second recombinant polypeptides, multimeric-protein-complexes, 
5 heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-f usion-poiypeptides, or the first and/or second thioredoxin-related 
proteins. 

In accordance with the present invention, the first and/or second 

10 recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins; and the oil-body-targeting- 
protein are conveniently produced in a cell. In order to produce the recombinant 

1 5 polypeptides or multimeric-protein-complexes, a nucleic acid sequence encoding 
either the first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 

20 proteins; and/or the oil-body-targeting-protein are incorporated in a recombinant 
expression vector. Accordingly, provided herein are recombinant expression 
vectors comprising the chimeric nucleic acids provided herein suitable for 
expression of the oil-body-targeting-protein and the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

25 protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins, suitable for the selected 
cell. The term "suitable for expression in the selected cell" means that the 
recombinant expression vector contains all nucleic acid sequences required to 

30 ensure expression in the selected cell. 

Accordingly, the recombinant expression vectors further contain 
regulatory nucleic acid sequences selected on the basis of the cell which is used 
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for expression and ensuring initiation and termination of transcription operatively 
linked to the nucleic acid sequence encoding the recombinant polypeptide or 
multimeric-protein-complex and/or the oil-body-targeting-protein. Regulatory 
nucleic acid sequences include promoters, enhancers, silencing elements, 
5 ribosome binding sites, Shine-Dalgarno sequences, introns and other expression 
elements. "Operatively linked" is intended to mean that the nucleic acid 
sequences comprising the regulatory regions linked to the nucleic acid sequences 
encoding the recombinant polypeptide or multimeric-protein-complex and/or the 
oil-body-targeting-protein allow expression in the cell. A typical nucleic acid 

10 construct comprises in the 5' to 3' direction a promoter region capable of 
directing expression, a coding region comprising the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 
protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 

15 or the first and/or second thioredoxin-related proteins; and/or an oil-body- 
targeting-protein and a termination region functional in the selected cell. 

The selection of regulatory sequences will depend on the organism and 
the cell type in which the first and/or second recombinant polypeptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 

20 fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 

immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, or the first and/or 
second thioredoxin-related proteins; and/or the oil-body-targeting-protein is 
expressed, and may influence the expression levels of the polypeptide. 
Regulatory sequences are art-recognized and selected to direct expression of the 

25 oil-body-targeting-protein and the recombinant polypeptides or multimeric- 
protein-complexes in the cell. 

Promoters that may be used in bacterial cells include the lac promoter 
(Blackman et al. (1978) Cell: 13: 65-71), the trp promoter (Masuda et al. (1996) 
Protein Eng: 9: 101-106) and the T7 promoters (Studier et al. (1986) J. Mol. 

30 Biol. 189: 1 13-130). Promoters functional in plant cells that may be used herein 
include constitutive promoters such as the 35S CaMV promoter (Rothstein et al. 
(1987) Gene: 53: 153-161) the actin promoter (McElroy et al. (1990) Plant Cell 
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2: 163-171) and the ubiquitin promoter (European Patent Application 0 342 
926). Other promoters are specific to certain tissues or organs (for example, 
roots, leaves, flowers or seeds) or cell types (for example, leaf epidermal cells, 
mesophyll cells or root cortex cells) and or to certain stages of plant 
5 development. Timing of expression may be controlled by selecting an inducible 
promoter, for example the PR-a promoter described in US Patent 5,614,395. 
Selection of the promoter therefore depends on the desired location and timing 
of the accumulation of the desired polypeptide. In a particular embodiment, the 
first and/or second recombinant polypeptides, multimeric-protein-complexes, 

1 0 heteromultimeric-protein-complexes, multimeric-f usion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins; and the oil-body-targeting-protein are expressed in a seed cell and seed 
specific promoters are utilized. Seed specific promoters that may be used herein 

15 include for example the phaseolin promoter (Sengupta-Gopalan et al. (1985) 
Proc. Natl. Acad. Sci. USA: 82: 3320-3324), and the Arabidopsis 18 kDa 
oleosin promoter (van Rooijen et al. (1992) Plant. Mol. Biol. 18: 1177-1179). 
New promoters useful in various plant cell types are constantly discovered. 
Numerous examples of plant promoters may be found in Ohamuro et al. 

20 (Biochem of PI. (1989) 15: 1-82). 

Genetic elements capable of enhancing expression of the polypeptide may 
be included in the expression vectors. In plant cells these include for example, 
the untranslated leader sequences from viruses such as the AMV leader 
sequence (Jobling and Gehrke (1987) Nature: 325: 622-625) and the intron 

25 associated with the maize ubiquitin promoter (See: US Patent 5,504,200). 

Transcriptional terminators are generally art recognized and besides 
serving as a signal for transcription termination serve as a protective element 
serving to extend the mRNA half-life (Guarneros et al. (1982) Proc. Natl. Acad. 
Sci. USA: 79: 238-242). In nucleic acid sequences for the expression in plant 

30 cells, the transcriptional terminator typically is from about 200 nucleotide to 
about 1000 nucleotides in length. Terminator sequences that may be used 
herein include for example, the nopaline synthase termination region (Bevan et 
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al. (1983) Nucl. Acid. Res.: 1 1: 369-385), the phaseolin terminator (van der 
Geest et al. (1 994) Plant J.: 6: 413-423), the terminator for the octopine 
synthase gene of Agrobacterium tumefaciens or other similarly functioning 
elements. Transcriptional terminators can be obtained as described by An 
5 (1987) Methods in Enzym. 153: 292). The selection of the transcriptional 
terminator may have an effect on the rate of transcription. 

Accordingly, provided herein are chimeric nucleic acid sequences 
encoding a first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
1 0 heteromultimeric-f usion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or thioredoxin-related proteins. In one 
embodiment, said nucleic acid comprises: 

(a) a first nucleic acid sequence encoding an oil-body-targeting-protein 
operatively linked in reading frame to; 
15 (b) a second nucleic acid sequence encoding a first recombinant 

polypeptide, immunoglobulin-polypeptide-chain, or redox protein; linked in 
reading frame to; 

(c) a third nucleic acid sequence encoding a second recombinant 
polypeptide, immunoglobulin-polypeptide-chain or redox protein, wherein said 
20 first and second recombinant polypeptides, immunoglobulin-polypeptide-chains 
or redox proteins are capable of forming a multimeric-protein-complex. 

In another embodiment, provided herein is an expression vector 
comprising: 

1 ) a first nucleic acid sequence capable of regulating transcription in said 
25 cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a nucleic acid sequence encoding a sufficient portion 
of an oil-body-protein to provide targeting of said recombinant fusion polypeptide 
to an oil body linked in reading frame to (ii) a nucleic acid sequence, encoding a 

30 multimeric-fusion-protein, such as a redox fusion polypeptide or immunoglobulin, 
comprising a first recombinant polypeptide, such as a redox protein or 
immunoglobulin-polypeptide-chain, linked to a second recombinant polypeptide, 
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such as a second redox protein or a second immunoglobulin-polypeptide-chain, 
operatively linked to; 

3) a third nucleic acid sequence capable of terminating transcription in 
said cell. 

5 The recombinant expression vector further may contain a marker gene. 

Marker genes that may be used in accordance with the present invention include 
all genes that allow the distinction of transformed cells from non-transformed 
cells including all selectable and screenable marker genes. A marker may be a 
resistance marker such as an antibiotic resistance marker against for example 

10 kanamycin, ampicillin, G418, bleomycin hygromycin, chloramphenicol which 
allows selection of a trait by chemical means or a tolerance marker against for 
example a chemical agent such as the normally phytotoxic sugar mannose 
(Negrotto et al. (2000) Plant Cell Rep. 19: 798-803). In plant recombinant 
expression vectors herbicide resistance markers may conveniently be used for 

15 example markers conferring resistance against glyphosate (US Patents 

4,940,935 and 5,188,642) or phosphinothricin (White et al. (1990) Nucl. Acids 
Res. 18: 1062; Spencer et al. (1990) Theor. Appl. Genet. 79: 625-631). 
Resistance markers to a herbicide when linked in close proximity to the redox 
protein or oil-body-targeting-protein may be used to maintain selection pressure 

20 on a population of plant cells or plants for those plants that have not lost the 
protein of interest. Screenable markers that may be employed to identify 
transformants through visual observation include beta-glucuronidase (GUS) (see 
US Patents US5,268,463 and US5,599,670) and green fluorescent protein (GFP) 
(Niedz et al. (1995) Plant Cell Rep.: 14: 403). 

25 The recombinant expression vectors further may contain nucleic acid 

sequences encoding targeting signals ensuring targeting to a cell compartment or 
organelle. Suitable targeting signals that may be used herein include those that 
are capable of targeting polypeptides to the endomembrane system. Exemplary 
targeting signals that may be used herein include targeting signals capable of 

30 directing the protein to the periplasm, the cytoplasm, the golgi apparatus, the 
apoplast (Sijmons et al., 1990, Bio/Technology, 8:217-221) the chloroplast 
(Comai et al. (1988) J. Biol. Chem. 263: 15104-15109), the mitochondrion, the 
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peroxisome (linger et al. (1989) Plant Mol. Biol. 13: 41 1-418), the ER, the 
vacuole (Shinshi et al. (1990) Plant Mol. Biol. 14: 357-368 and the oil body. By 
the inclusion of the appropriate targeting sequences it is possible to direct the 
oil-body-targeting-protein or the first and/or second recombinant polypeptides, 
5 multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 
fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and/or 
thioredoxin-related proteins, to the desired organelle or cell compartment. 

The recombinant expression vectors of the present invention may be 

10 prepared in accordance with methodologies well known to those of skill in the 
art of molecular biology (see for example: Sambrook et al. (1990) Molecular 
Cloning, 2 nd ed. Cold Spring Harbor Press). The preparation of these constructs 
may involve techniques such as restriction digestion, ligation, gel 
electrophoresis, DNA sequencing and PCR. A wide variety of cloning vectors is 

15 available to perform the necessary cloning steps resulting in a recombinant 

expression vector ensuring expression of the polypeptide. Especially suitable for 
this purpose are vectors with a replication system that is functional in 
Escherichia co//such as pBR322, the PUC series of vectors, the M13mp series 
of vectors, pBIuescript etc. Typically these vectors contain a marker allowing 

20 the selection of transformed cells for example by conferring antibiotic resistance. 
Nucleic acid sequences may be introduced in these vectors and the vectors may 
be introduced in E coli grown in an appropriate medium. Vectors may be 
recovered from cells upon harvesting and lysing the cells. 

Recombinant expression vectors suitable for the introduction of nucleic 

25 acid sequences in plant cells include Agrobacterium and Rhizobium based 

vectors such as the Ti and Ri plasmids. Agrobacterium based vectors typically 
carry at least one T-DNA border sequence and include vectors such pBIN 19 
(Bevan (1984) Nucl Acids Res. Vol. 12, 22:871 1-8721) and other binary vector 
systems (for example: US Patent 4,940,838). 

30 Production of c lis comprising a first and/or second recombinant polyp ptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 
fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins. 
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immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and/or a first 
and/or second thioredoxin-related protein and oil-body-targeting-proteins 

In accordance with the present invention, the recombinant expression 
vectors are introduced into the cell that is selected and the selected cells are 
5 grown to produce the first and/or second recombinant polypeptides, multimeric- 
protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 
polypeptide-chains, redox-fusion-polypeptides, a first and/br second thioredoxin- 
related protein; and the oil-body-targeting-protein either directly or in a progeny 
10 cell. 

Methodologies to introduce recombinant expression vectors into a cell 
also referred to herein as "transformation" are well known to the art and vary 
depending on the cell type that is selected. General techniques to transfer the 
recombinant expression vectors into the cell include electroporation; chemically 

15 mediated techniques, for example CaCI2 mediated nucleic acid uptake; particle 
bombardment (biolistics); the use of naturally infective nucleic acid sequences 
for example virally derived nucleic acid sequences or when plant cells are used 
Agrobacterium or Rhizobium derived nucleic acid sequences; PEG mediated 
nucleic acid uptake, microinjection, and the use of silicone carbide (Whiskers 

20 (Kaeppler et al. (1990) Plant Cell Rep. 9:415-418) all of which may be used 
herein. 

Introduction of the recombinant expression vector into the cell may result 
in integration of its whole or partial uptake into host cell genome including the 
chromosomal DNA or the plastid genome. Alternatively the recombinant 

25 expression vector may not be integrated into the genome and replicate 

independently of the host cell's genomic DNA. Genomic integration of the 
nucleic acid sequence is typically used as it will allow for stable inheritance of 
the introduced nucleic acid sequences by subsequent generations of cells and 
the creation of cell, plant or animal lines. 

30 Particular embodiments involve the use of plant cells. Particular plant 

cells used herein include cells obtainable from Brazil nut (Betholletia exce/sa); 
castor (Riccinus communis)', coconut (Cocus nucifera); coriander (Cor/andrum 
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sativum); cotton (Gossypium spp.); groundnut (Arachis hypogaea); jojoba 
(Simmondsia chinensis); linseed/flax (Linum usitatissimum); maize (Zea mays); 
mustard (Brassica spp. and Sinapis alba); oil palm (Elaeis guineeis); olive (Olea 
europaea); rapeseed (Brassica spp.); safflower (Carthamus tinctorius); soybean 
5 (Glycine max); squash (Cucurbita maxima); barley (Hordeum vulgare); wheat 
(Traeticum aestivum) and sunflower (Helianthus annuus). 

Transformation methodologies for dicotelydenous plant species are well 
known. Generally Agrobacterium mediated transformation is utilized because of 
its high efficiency as well as the general susceptibility by many, if not all 

10 dicotelydenous plant species. Agrobacterium transformation generally involves 
the transfer of a binary vector (e.g. pBIN19) comprising the DNA of interest to 
an appropriate Agrobacterium strain (e.g. CIB542) by for example tri-parental 
mating with an £ coli strain carrying the recombinant binary vector and an £ 
coli strain carrying a helper plasmid capable of mobilization of the binary vector 

15 to the target Agrobacterium strain, or by DNA transformation of the 

Agrobacterium strain (Hofgen et al. Nucl. Acids. Res. (1988) 16: 9877. Other 
transformation methodologies that may be used to transform dicotelydenous 
plant species include biolistics (Sanford (1988) Trends in Biotechn. 6: 299-302); 
electroporation (Fromm et al. (1985) Proc. Natl. Acad. Sci. USA 82: 5824- 

20 5828); PEG mediated DNA uptake (Potrykus et al. (1985) Mol. Gen. Genetics 
199: 169-177); microinjection (Reich et al. Bio/Techn. (1986) 4: 1001-1004) 
and silicone carbide whiskers (Kaeppler et al. (1990) Plant Cell Rep. 9: 415- 
418). The exact transformation methodologies typically vary somewhat 
depending on the plant species that is used. 

25 In a particular embodiment the oil bodies are obtained from safflower and 

the recombinant proteins are expressed in safflower. Safflower transformation 
has been described by Baker and Dyer (Plant Cell Rep. (1996) 16: 106-110). 

Monocotelydenous plant species may now also be transformed using a 
variety of methodologies including particle bombardment (Christou et al. (1991) 

30 Biotechn. 9: 957-962; Weeks et al. Plant Physiol. (1993) 102: 1 077-1 Q84; 
Gordon-Kamm et al. Plant Cell (1990) 2: 603-618) PEG mediated DNA uptake 
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(EP 0 292 435; 0 392 225) or Agrobacterium-medlated transformation (Goto- 
Fumiyukt et al (1999) Nature-Biotech. 17 (3):282-286). 

Plastid transformation is described in US Patents 5,451,513; 5,545,817 
and 5,545,818; and PCT Patent Applications 95/16783; 98/1 1235 and 
5 00/39313) Basic chloroplast transformation involves the introduction of cloned 
plastid DNA flanking a selectable marker together with the nucleic acid sequence 
of interest into a suitable target tissue using for example biolistics or protoplast 
transformation. Selectable markers that may be used include for example the 
bacterial aadA gene {Svab et al. (1993) Proc. Natl. Acad. Sci. USA 90; 913- 
10 917). Plastid promoters that may be used include for example the tobacco cIpP 
gene promoter {PCT Patent Application 97/06250). 

In another embodiment, the invention chimeric nucleic acid constructs 
provided herein are directly transformed into the plastid genome. Plastid 
transformation technology is described extensively in U.S. Patent Nos. 
15 5,451,513, 5,545,817, 5,545,818 and 5,576,198; in PCT application nos. WO 
95/16783 and WO 97/32977; and in McBride et. al., Proc Natl Acad Sci USA 
91: 7301-7305 (1994), the entire disclosures of all of which are hereby 
incorporated by reference. In one embodiment, plastid transformation is 
achieved via biolistics, first carried out in the unicellular green alga 
20 Chlamydomonas reinhardtii (Boynton et al. (1988) Science 240:1 534-1 537)) and 
then extended to Nicotiana tabacum (Svab et al. (1 990) Proc Natl Acad Sci USA 
87:8526-8530), combined with selection for cis-acting antibiotic resistance loci 
(spectinomycin or streptomycin resistance) or complementation of non- 
photosynthetic mutant phenotypes. 
25 In another embodiment, tobacco plastid transformation is carried out by 

particle bombardment of leaf or callus tissue, or polyethylene glycol (PEG)- 
mediated uptake of plasmid DNA by protoplasts, using cloned plastid DNA 
flanking a selectable antibiotic resistance marker. For example, 1 to 1 .5 kb 
flanking regions, termed targeting sequences, facilitate homologous 
30 recombination with the plastid genome and allow the replacement or * 

modification of specific regions of the 156 kb tobacco plastid genome. In one 
embodiment, point mutations in the plastid 16S rDNA and rps12 genes 
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conferring resistance to spectinomycin and/or streptomycin can be utilized as 
selectable markers for transformation (Svab et aL (1990) Proc Natl Acad Sci 
USA 87:8526-8530; Staub et aL (1992) Plant Ce// 4:39-45, the entire 
disclosures of which are hereby incorporated by reference), resulting in stable 
5 homoplasmic transformants at a frequency of approximately one per 100 
bombardments of target leaves. The presence of cloning sites between these 
markers allows creation of a plastid targeting vector for introduction of foreign 
genes (Staub et aL (1993) EMBO J 12:601-606, the entire disclosure of which is 
hereby incorporated by reference). In another embodiment, substantial increases 

10 in transformation frequency can be obtained by replacement of the recessive 
rRNA or r-protein antibiotic resistance genes with a dominant selectable marker, 
the bacterial aadA gene encoding the spectinomycin-detoxifying enzyme 
aminoglycoside-3'-adenyltransferase {Svab et aL (1993) Proc Natl Acad Sci USA 
90: 913-917, the entire disclosure of which is hereby incorporated by reference). 

15 This marker has also been used successfully for high-frequency transformation 
of the plastid genome of the green alga Chlamydomonas reinhardtii 
(Goldschmidt-Clermont, M. (1991) Nucl Acids Res 19, 4083-4089, the entire 
disclosure of which is hereby incorporated by reference). In other embodiments, 
plastid transformation of protoplasts from tobacco and the moss Physcomitrella 

20 can be attained using PEG-mediated DNA uptake (O'Neill et aL (1993) Plant J 
3:729-738; Koop et aL (1996) Planta 199:193-201, the entire disclosures of 
which are hereby incorporated by reference). 

Both particle bombardment and protoplast transformation are also 
contemplated for use herein. Plastid transformation of oilseed plants has been 

25 successfully carried out in the genera Arabidopsis and Brassica (Sikdar et al. 
(1998) Plant Cell Rep 18:20-24; PCT Application WO 00/39313, the entire 
disclosures of which are hereby incorporated by reference). 

A chimeric nucleic sequence construct is inserted into a plastid 
expression cassette including a promoter capable of expressing the construct in 

30 plant plastids. A particular promoter capable of expression in a plant plastid is, 
for example, a promoter isolated from the 5' flanking region upstream of the 
coding region of a plastid gene, which may come from the same or a different 
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species, and the native product of which is typically found in a majority of 
plastid types including those present in non-green tissues. Gene expression in 
plastids differs from nuclear gene expression and is related to gene expression in 
prokaryotes (Stern et a/. (1 997) Trends in Plant Sci 2:308-31 5, the entire 
5 disclosure of which is hereby incorporated by reference). 

Plastid promoters generally contain the -35 and -10 elements typical of 
prokaryotic promoters, and some plastid promoters called PEP (plastid-encoded 
RNA polymerase) promoters are recognized by an E. co//-like RNA polymerase 
mostly encoded in the plastid genome, while other plastid promoters called NEP 

10 promoters are recognized by a nuclear-encoded RNA polymerase. Both types of 
plastid promoters are suitable for use herein. Examples of plastid promoters 
include promoters of cIpP genes such as the tobacco cIpP gene promoter (WO 
97/06250, the entire disclosure of which is hereby incorporated by reference) 
and the Arabidopsis cIpP gene promoter (U.S. Application No. 09/038,878, the 

15 entire disclosure of which is hereby incorporated by reference). Another 

promoter capable of driving expression of a chimeric nucleic acid construct in 
plant plastids comes from the regulatory region of the plastid 1 6S ribosomar RNA 
operon (Harris etal., (1994) Microbiol Rev 58:700-754; Shinozaki etaL (1986) 
EMBO J 5:2043-2049, the entire disclosures of both of which are hereby 

20 incorporated by reference). Other examples of promoters capable of driving 

expression of a nucleic acid construct in plant plastids include a psbA promoter 
or am rbcL promoter. A plastid expression cassette preferably further includes a 
plastid gene 3' untranslated sequence (3' UTR) operatively linked to a chimeric 
nucleic acid construct of the present invention. The role of untranslated 

25 sequences is preferably to direct the 3' processing of the transcribed RNA rather 
than termination of transcription. An exemplary 3' UTR is a plastid rps16 gene 
3' untranslated sequence, or the Arabidopsis plastid psbA gene 3' untranslated 
sequence. In a further embodiment, a plastid expression cassette includes a 
poly-G tract instead of a 3' untranslated sequence. A plastid expression 

30 cassette also preferably further includes a 5' untranslated sequence (5' UTR) 

functional in plant plastids, operatively linked to a chimeric nucleic acid construct 
provided herein. 
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A plastid expression cassette is contained in a plastid transformation 
vector, which preferably further includes flanking regions for integration into the 
plastid genome by homologous recombination. The plastid transformation vector 
♦ may optionally include at least one plastid origin of replication. The present 
5 invention also encompasses a plant plastid transformed with such a plastid 

transformation vector, wherein the chimeric nucleic acid construct is expressible 
in the plant plastid. Also encompassed herein is a plant or plant cell, including 
the progeny thereof, including this plant plastid. In a particular embodiment, the 
plant or plant cell, including the progeny thereof, is homoplasmic for transgenic 
10 plastids. 

Other promoters capable of driving expression of a chimeric nucleic acid 
construct in plant plastids include transactivator-regulated promoters, preferably 
heterologous with respect to the plant or to the subcellular organelle or 
component of the plant cell in which expression is effected. In these cases, the 

15 DNA molecule encoding the transact ivator is inserted into an appropriate nuclear 
expression cassette which is transformed into the plant nuclear DNA. The 
transactivator is targeted to plastids using a plastid transit peptide. The 
transactivator and the transactivator-driven DNA molecule are brought together 
either by crossing a selected plastid-transformed line with and a transgenic line 

20 containing a DNA molecule encoding the transactivator supplemented with a 
plastid-targeting sequence and operably linked to a nuclear promoter, or by 
directly transforming a plastid transformation vector containing the desired DNA 
molecule into a transgenic line containing a chimeric nucleic acid construct 
encoding the transactivator supplemented with a plastid-targeting sequence 

25 operably linked to a nuclear promoter. If the nuclear promoter is an inducible 
promoter, in particular a chemically inducible embodiment, expression of the 
chimeric nucleic acid construct in the plastids of plants is activated by foliar 
application of a chemical inducer. Such an inducible transactivator-mediated 
plastid expression system is preferably tightly regulatable, with no detectable 

30 expression prior to induction and exceptionally high expression and acpumulation 
of protein following induction. 
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A particular transactivator is, for example, viral RNA polymerase. 
Particular promoters of this type are promoters recognized by a single sub-unit 
RNA polymerase, such as the T7 gene 10 promoter, which is recognized by the 
bacteriophage T7 DNA-dependent RNA polymerase. The gene encoding the T7 
5 polymerase is preferably transformed into the nuclear genome and the T7 

polymerase is targeted to the plastids using a plastid transit peptide. Promoters 
suitable for nuclear expression of a gene, for example a gene encoding a viral 
RNA polymerase such as the T7 polymerase, are described above and elsewhere 
in this application. Expression of chimeric nucleic acid constructs in plastids can 

10 be constitutive or can be inducible, and such plastid expression can be also 
organ- or tissue-specific. Examples of various expression systems are 
extensively described in WO 98/1 1235, the entire disclosure of which is hereby 
.i incorporated by reference. Thus, in one aspect, the present invention utilizes 
coupled expression in the nuclear genome of a chloroplast-targeted phage T7 

15 RNA polymerase under the control of the chemically inducible PR-1a promoter, 
for example of the PR-1 promoter of tobacco, operably linked with, a chloroplast 
reporter transgene regulated by T7 gene 1 0 promoter/terminator sequences, for 
example as described in as in US Patent No. 5,614,395 the entire disclosure of 
which is hereby incorporated by reference. In another embodiment, when 

20 plastid transformants homoplasmic for the maternally inherited TR or NTR genes 
are pollinated by lines expressing the T7 polymerase in the nucleus, F1 plants 
are obtained that carry both transgene constructs but do not express them until 
synthesis of large amounts of enzymatically active protein in the plastids is 
triggered by foliar application of the PR-1 a inducer compound 

25 benzo(1,2,3)thiadiazole-7-carbothioic acid S-methyl ester (BTH). 

In a particular embodiment, two or more genes, for example TR and NTR 
genes, are transcribed from the plastid genome from a single promoter in an 
operon-like polycistronic gene. In one embodiment, the operon-like polycistronic 
gene includes an intervening DNA sequence between two genes in the operon- 

30 like polycistronic gene. In a particular embodiment, the intervening DNA 
sequence is not present in the plastid genome to avoid homologous 
recombination with plastid sequences. In another embodiment, the DNA 
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sequence is derived from the 5 ' untranslated (UTR) region of a non-eukaryotic 
gene, preferably from a viral 5 'UTR, preferably from a 5 'UTR derived from a 
bacterial phage, such as a T7, T3 or SP6 phage. In one embodiment, a portion 
of the DNA sequence may be modified to prevent the formation of RNA 
5 secondary structures in an RNA transcript of the operon-like polycistronic gene, 
for example between the DNA sequence and the RBS of the downstream gene. 
Such secondary structures may inhibit or repress the expression of the 
downstream gene, particularly the initiation of translation. Such RNA secondary 
structures are predicted by determining their melting temperatures using 
10 computer models and programs such a the "mfold" program version 3 (availabl 
from Zuker and Turner, Washington University School of Medicine, St-Louis, 
MO) and other methods known to one skilled in the art. 

The presence of the intervening DNA sequence in the operon-like 
polycistronic gene increases the accessibility of the RBS of the downstream 
15 gene, thus resulting in higher rates of expression. Such strategy is applicable to 
any two or more genes to be transcribed from the plastid genome from a single 
promoter in an operon-like chimeric heteromultimeric gene. 

Following transformation the cells are grown, typically in a selective 
medium allowing the identification of transformants. Cells may be harvested in 
20 accordance with methodologies known to the art. In order to associate the oil 
bodies with the first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and a first and/or second thioredoxin-related 
25 protein, the integrity of cells may be disrupted using any physical, chemical or 
biological methodology capable of disrupting the cells' integrity. These 
methodologies are generally cell-type dependent and known to the skilled 
artisan. Where plants are employed they may be regenerated into mature plants 
using plant tissue culture techniques generally known to the skilled artisan. 
30 Seeds may be harvested from mature transformed plants and used to propagate 
the plant line. Plants may also be crossed and in this manner, contemplated 
herein is the breeding of cells lines and transgenic plants that vary in genetic 
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background. It is also possible to cross a plant line comprising the first 
recombinant polypeptide with a plant line comprising the second recombinant 
polypeptide. Accordingly, also provided herein are methods of producing in a 
plant a recombinant multimeric-protein-complex, said method comprising: 
5 (a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide, such as a redox protein (e.g., a thioredoxin-related 
protein, and the like) or an immunoglobulin-polypeptide-chain, wherein said first 
recombinant polypeptide is capable of associating with said oil bodies through an 
oil-body-targeting-protein; 
10 (b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide, such as a second redox protein (e.g., a 
thioredoxin-related protein, and the like) or a second immunoglobulin- 
polypeptide-chain; and 

(c) sexually crossing said first plant with said second plant to produce a progeny 
15 plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 
are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 
The second recombinant polypeptide may also associate with the oil 
20 bodies. Accordingly, also provided herein are methods of producing in a plant a 
recombinant multimeric-protein-complex, said method comprising: 

(a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide, such as a redox (or thioredoxin-related) protein or 
immunoglobulin-polypeptide-chain, wherein said first recombinant polypeptide is 

25 capable of associating with said oil bodies through an oil-body-targeting-protein; 

(b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide, such as a second redox (thioredoxin-related) 
protein or a second immunoglobulin-polypeptide-chain, wherein said second 
recombinant polypeptide is capable of associating with said oil bodies through an 

30 oil body targeting protein; and 

(c) sexually crossing said first plant with said second plant to produce a progeny 
plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 
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are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 
Isolation of Oil bodies 
5 The oil bodies provided herein may be obtained from any cell containing 

oil bodies, including any animal cell; plant cell; fungal cell; for example a yeast 
cell, algae cell; or bacterial cell. Any process suitable for the isolation oil bodies 
from cells may be used herein. Processes for the isolation of oil bodies from 
plant seed cells have been described in US Patents (6,146,645 and 6,183,762) 

10 and the isolation of oil bodies from yeast cells has been described by Ting et al. 
(1997) J. Biol. Chem. 272: 3699-3706). 

In certain embodiments, the oil bodies are obtained from a plant cell such 
as for example a pollen cell; a fruit cell; a spore cell; a nut cell; mesocarp cell; for 
example the mesocarp cells obtainable from olive (Olea europaea) or avocado 

15 (Persea americana); or a seed cell. In particular embodiments the oil bodies are 
obtained from a plant seed cell. The seeds can be obtained from a transgenic 
plant according to the present invention. In particular embodiments, a seed of a 
transgenic plant according to the present invention contains the first and/or 
second recombinant polypeptides, multimeric-protein-complexes, 

20 heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or first and/or second thioredoxin-related 
proteins in a concentration of at least about 0.5% of total cellular seed protein. 
In further embodiments, a seed of a transgenic plant provided herein contains a 

25 recombinant polypeptide or multimeric-protein-complex in a concentration of at 
least about 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, 1.0%, 1.25%, 1.5%, 1.75%, 
2.0%, 2.25%, 2.5%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10% or more, of total 
cellular seed protein. The upper limits of the recombinant polypeptide or 
multimeric-protein-complex concentration can be up to about 8%, 9%, 10%, 

30 11%, 12%, 13%, 14%, 15%. Thus, the ranges at least about 0.5%. up to 

about 1 5%; at least about 1.0% up to about 10%; and at least about 5% up to 
about 8% are among the various ranges contemplated herein. 
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Among the plant seeds useful in this regard are plant seeds obtainable 
from the group of plant species consisting of Brazil nut (Betholletia excelsa); 
castor (Riccinus communis); coconut (Cocus nucifera); coriander (Coriandrum 
sativum)', cotton (Gossypium spp.); groundnut (Arachis hypogaea); jojoba 
5 (Simmondsia chinensis); linseed/flax (Linum usitatissimum); maize {Zea mays); 
mustard (Brassica spp. and Sinapis alba); oil palm (E/aeis guineeis); olive [Olea 
europaea); rapeseed (Brassica spp.); safflower (Carthamus tinctorius); soybean 
(Glycine max); squash (Cucurbita maxima); sunflower (Helianthus annuus); 
barley (Hordeum vu/gare); wheat (Traeticum aestivum) and mixtures thereof. In 

10 a particular embodiment, oil bodies are obtainable from the seeds obtainable 
from safflower (Carthamus tinctorius). 

In order to prepare oil bodies from plant seeds, plants are grown and 
allowed to set seed in accordance with common agricultural practices. Thus, 
the present invention also provides seeds comprising oil bodies, wherein said oil 

15 bodies further comprise invention multimeric-protein-complexes described herein. 
Upon harvesting the seed and, if necessary the removal of large insoluble 
materials such as stones or seed hulls, by for example sieving or rinsing, any 
process suitable for the isolation of oil bodies from seeds may be used herein. A 
typical process involves grinding of the seeds followed by an aqueous extraction 

20 process. 

Seed grinding may be accomplished by any comminuting process 
resulting in a substantial disruption of the seed cell membrane and cell walls 
without compromising the structural integrity of the oil bodies present in the 
seed cell. Suitable grinding processes in this regard include mechanical pressing 

25 and milling of the seed. Wet milling processes such as described for cotton 
(Lawhon et al. (1977) J. Am. Oil Chem. Soc. 63: 533-534) and soybean (US 
Patent 3,971,856; Carter et al. (1974) J. Am. Oil Chem. Soc. 51: 137-141) are 
particularly useful in this regard. Suitable milling equipment capable of industrial 
scale seed milling include colloid mills, disc mills, pin mills, orbital mills, IKA mills 

30 and industrial scale homogenizers. The selection of the milling equipment will 
depend on the seed, which is selected, as well as the throughput requirement. 
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Solid contaminants such as seed hulls, fibrous materials, undissolved 
carbohydrates, proteins and other insoluble contaminants are subsequently 
preferably removed from the ground seed fraction using size exclusion based 
methodologies such as filtering or gravitational based methods such as a 
5 centrifugation based separation process. Centrifugation may be accomplished 
using for example a decantation centrifuge such as a HASCO 200 2-phase 
decantation centrifuge or an NX310B (Alpha Laval). Operating conditions are 
selected such that a substantial portion of the insoluble contaminants and 
sediments and may be separated from the soluble fraction. 

10 Following the removal of insolubles the oil body fraction may be 

separated from the aqueous fraction. Gravitational based methods as well as 
size exclusion based technologies may be used. Gravitational based methods 
that may be used include centrifugation using for example a tubular bowl 
centrifuge such as a Sharpies AS-16 or AS-46 (Alpha Laval), a disc stack 

1 5 centrifuge or a hydrocyclone, or separation of the phases under natural 

gravitation. Size exclusion methodologies that may be used include membrane 
ultra filtration and crossflow microfiltration. 

Separation of solids and separation of the oil body phase from the 
aqueous phase may also be carried out concomitantly using gravity based 

20 separation methods or size exclusion based methods. 

The oil body preparations obtained at this stage in the process are 
generally relatively crude and depending on the application of the oil bodies, it 
may be desirable to remove additional contaminants. Any process capable of 
removing additional seed contaminants may be used in this regard. Conveniently 

25 the removal of these contaminants from the oil body preparation may be 

accomplished by resuspending the oil body preparation in an aqueous phase and 
re-centrifuging the resuspended fraction, a process referred to herein as 
"washing the oil bodies". The washing conditions selected may vary depending 
on the desired purity of the oil body fractions. For example where oil bodies are 

30 used in pharmaceutical compositions, generally a higher degree of purity may be 
desirable than when the oil bodies are used in food preparations. The oil bodies 
may be washed one or more times depending on the desired purity and the ionic 
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strength, pH and temperature may all be varied. Analytical techniques may be 
used to monitor the removal of contaminants. For example SDS gel 
electrophoresis may be employed to monitor the removal of seed proteins. 

The entire oil body isolation process may be performed in a batch wise 
5 fashion or continuous flow. In a particular embodiment, industrial scale 
continuous flow processes are utilized. 

Through the application of these and similar techniques the skilled artisan 
is able to obtain oil bodies from any cell comprising oil bodies. The skilled 
artisan will recognize that generally the process will vary somewhat depending 

10 on the cell type that is selected. However, such variations may be made 
without departing from the scope and spirit of the present invention. 
Association of the first and/or second recombinant polypeptides, multimeric- 
protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 

1 5 polypeptide-chains, redox-fusion-polypeptides, the first and/or second 
• thioredoxin-related proteins with oii bodies. 

In accordance with the present invention, the oil bodies are associated 
with either the first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

20 heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, the first and/or second thioredoxin-related 
proteins through association with an oil-body-targeting-protein capable of 
association with these multimeric-protein-complexes and the oil bodies. As used 
herein the phrase "associating the oil bodies with the multimeric-protein- 

25 complex" means that the oil bodies are brought in proximity of the multimeric- 
protein-complexes in a manner that allows the association of the oil bodies with 
either the first and/or second recombinant polypeptides, multimeric-prptein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 

30 chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins. The association of the oil bodies with the multimeric-protein- 
complexes is accomplished by association of the oil-body-targeting-protein with 
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both the oil body and with the multimeric-protein-complex. In particular 
embodiments, the cells expressing the multimeric-protein-complex associate with 
the oil bodies that are obtainable from these same cells, which permits the 
convenient production and isolation of the multimeric-protein-complex, including 
5 the first and/or second recombinant polypeptides, heteromultimeric-protein- 
complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins, in an oil body-comprising 
host cell system. Accordingly, in one embodiment, the association of the oil 

10 body with the multimeric-protein-complex is accomplished fntracellularly during 
the growth of the cell. For example, a redox fusion polypeptide may be fused to 
an oil-body-protein and the chimeric protein may be expressed in oil body- 
containing plant seeds. Isolation of the oil bodies from the seeds in this case 
results in isolation of oil bodies comprising either the first and/or second 

1 5 recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins. In another embodiment, 
in which the multimeric-protein-complex associates with oil bodies obtainable 

20 from the same cells in which the complex is produced, the association of the oil 
bodies with the multimeric-protein-complex is accomplished upon disrupting the 
cell's integrity. 

For example, the first and/or second recombinant polypeptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 

25 fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 

immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, or the first and/or 
second thioredoxin-related proteins may be expressed in such a manner that it is 
targeted to the endomembrane system of the seed cells. Oil bodies present in 
the same seed cells comprising an oil-body-targeting-protein capable of 

30 association with these multimeric-protein-complexes, for example an qleosin 
linked to a single chain antibody capable of association with a recombinant 
polypeptide or multimeric-protein-complex, may then associate with the 
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recombinant polypeptide or multimeric-protein-complex upon grinding of the 
seed. 

In accordance with this embodiment, plant seed cells comprising a light 
and heavy chain of an immunoglobulin targeted to the plant apoplast can be 
5 prepared. These particular seed cells are prepared to further comprise oil bodies 
associated with an oil-body-targeting-protein capable of association with the 
immunoglobulin, such as for example, an oleosin-protein A fusion protein, and 
the like. Upon grinding of the seed, the oil bodies comprising protein A 
associate with the immunoglobulin through binding. 

10 In yet another embodiment, the oil bodies used to associate with the 

multimeric-protein-complex are obtained from a cellular source different from the 
cell comprising the first and/or second recombinant polypeptides, multimeric- 
protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 

15 polypeptide-chains, redox-fusion-polypeptides, or the first and/or second 

thioredoxin-related proteins, such as from a separate plant line. For example, oil 
bodies associated with protein A may be prepared from one plant line. These oil 
bodies may then be mixed with ground seeds comprising an apoplastically 
expressed light and heavy chain constituting an immunoglobulin. Alternatively, a 

20 plant line comprising oil bodies associated with protein A may be crossed with a 
plant line comprising an immunoglobulin. 

The first recombinant polypeptide, second recombinant polypeptide and 
oil-body-targeting-protein may also be prepared in separate cellular 
compartments. Association of the first polypeptide, second polypeptide, and oil 

25 body then may occur upon disruption of the cell's integrity. For example, 
various mechanisms for targeting gene products are known to exist in plants, 
and the sequences controlling the functioning of these mechanisms have been 
characterized in some detail. For example, the targeting of gene products to the 
chloroplast is controlled by a transit sequence found at the amino terminal end of 

30 various proteins which is cleaved during chloroplast import to yield the mature 
protein (Comai etal. ( 1 988) J Biol Chem 263: 15104-15109). Other gene 
products are localized to other organelles such as the mitochondrion and the 
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peroxisome (linger etal. (1989) Plant Mol Biol 13:41 1-418). The cDNAs 
encoding these products can be manipulated to target heterologous gene 
products to these organelles. In addition, sequences have been characterized 
which cause the targeting of gene products to other cell compartments. 
5 Amino terminal sequences are responsible for targeting to the ER, the 

apoplast, and extracellular secretion from aleurone cells (Koehler & Ho (1990) 
Plant Cell 2:769-783). Additionally, amino terminal sequences in conjunction 
with carboxy terminal sequences are responsible for vacuolar targeting of gene 
products (Shinshi etal., (1990) Plant Mol Biol 14:357-368). By the fusion of the 
10 appropriate targeting sequences described above to transgene sequences of 
interest it is possible to direct the transgene product to the desired organelle or 
cell compartment. 

As hereinbefore mentioned, the redox protein obtained using the methods 
provided herein is enzymatically active while associated with the oil body. 

1 5 Preferably the redox protein is at least 5 times more active when produced as a 
redox fusion polypeptide with a second redox protein relative to its production in 
association with an oil body as a non-fusion polypeptide (i.e. without the second 
redox protein). More preferably the redox protein is at least 10 times more 
active when produced as a redox fusion polypeptide. 

20 The activity of the redox fusion polypeptide may be determined in 

accordance with methodologies generally known to the art (see for example: 
Johnson et al (1984) J. of Bact. Vol. 158 3:1061-1069) and may be optimized 
by for example the addition of detergents, including ionic and non-ionic 
detergents. 

25 Formulation of Oil Bodies 

In accordance with a particular embodiment, the oil bodies comprising the 
first and/or second recombinant polypeptides, multimeric-protein-complexes, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 

30 chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins, are preferably formulated into an emulsion. The emulsion is preferably 
used in the preparation of a pharmaceutical composition, personal care or a food 
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product. In emulsified form, the oil body offers certain desirable properties, such 
as for example excellent compatibility with the human skin. 

It particular embodiments, the oil body formulation is stabilized so that a 
final product may be obtained which may be stored and preserved for longer 
5 periods of time. As used herein, the term "stabilized oil body preparation" refers 
to an oil body preparation that is prepared so that the formulation does not 
undergo undesirable physical or chemical alterations when the oil body 
preparation is stored. The stabilization requirements may vary depending on the 
final product. For example personal care products are preferably stable for at 

10 least one year at room temperature while additionally being able to withstand 
short temperature fluctuations. Pharmaceutical formulations may in some cases 
be less stable as they may be stored at lower temperatures thereby preventing 
the occurrence of undesirable reactions. 

In general, stabilization techniques that may be used herein include any 

15 and all methods for the preservation of biological material including the addition 
of chemical agents, temperature modulation based methodologies, radiation- 
based technologies and combinations thereof. In particular embodiments small 
amounts of stabilizing chemical agents are mixed, with the oil body formulation 
to achieve stabilization. These chemical agents include inter alia preservatives, 

20 antioxidants, acids, salts, bases, viscosity modifying agents, emulsifiers, gelling 
agents and mixtures thereof and may all be used to stabilize the oil body 
preparation. In view of the presence of the redox fusion polypeptide the 
stabilizing agent is generally selected to be compatible with and resulting in good 
enzymatic function of the redox fusion polypeptide. 

25 Diagnostic parameters to assess the stability of the oil body preparation 

may be as desired and include all parameters indicative of undesirable qualitative 
or quantitative changes with respect to chemical or physical stability. Typical 
parameters to assess the oil body preparation over time include color, odor, 
viscosity, texture, pH and microbial growth, and enzymatic activity. 

30 In particular embodiments, the oil body formulation is stabilized prior to 

the addition of further ingredients that may be used to prepare the final product. 
Howevera, in other embodiments, it is nevertheless possible to formulate the 
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final formulation using non-stabilized oil bodies and stabilize the final formulation. 

The final preparations may be obtained using one or more additional 
ingredients and any formulation process suitable for the preparation of a 
formulation comprising oil bodies. Ingredients and processes employed will 
5 generally vary depending on the desired use of the final product, will be art 
recognized and may be as desired. Ingredients and processes that may be used 
herein include those described in US Patents (US Patents 6,146,645 and 
6,183,762) which are incorporated by reference herein. 

In particular embodiments, the redox fusion polypeptide comprises a 

10 thioredoxin and a thioredoxin-reductase. Accordingly, provided herein are oil 
bodies comprising a thioredoxin/thioredoxin-reductase fusion polypeptide. Also 
provided herein is a formulation containing oil bodies comprising a 
thioredoxin/thioredoxin-reductase fusion capable of treating or protecting a target 
against oxidative stress. The stress of the target is treated or prevented by 

15 contacting the target with the formulation. The target may be any substance 
susceptible to oxidative stress, including any molecule, molecular complex, cell, 
tissue or organ. 

In another embodiment, provided herein is a formulation containing oil 
bodies comprising a thioredoxin/thioredoxin-reductase fusion capable of 

20 chemically reducing a target. Contacting the target with the formulation reduces 
the target. The target may be any substance susceptible to reduction, including 
any molecule or molecular complex. Particularly susceptible targets in this 
regard are the disulfide bonds present in proteins. 

The oil bodies comprising thioredoxin/thioredoxin-reductase may be used 

25 to prepare formulations used to reduce the allergenicity of food or increase. the 
digestibility of food. Preferably, the method of reducing the food allergenicity is 
practiced by mixing the thioredoxin/thioredoxin-reductase comprising oil bodies 
with food or food ingredients selected from a variety of sources including for 
example wheat flour, wheat dough, milk, cheese, soya, yogurt and ice cream. 

30 The thioredoxin/thioredoxin-reductase comprising oil bodies may also be used to 
increase the digestibility of milk as well as other disulfide containing proteins 
(Jiao, J. et al. (1992) J. Agric. Food Chem 40: 2333-2336), Further food 
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applications include the use of the oil thioredoxin/thioredoxin-reductase 
comprising oil bodies as a food additive to enhance dough strength and bread 
quality properties (Wong etal., (1993) J. Cereal Chem. 70: 113-114; Kobrehel 
et al. (1994) Gluten Proteins: Association of Cereal Research; Detmold, 
5 Germany). 

Also provided herein are pharmaceutical compositions comprising, in a 
pharmaceutically active carrier: oil bodies comprising a thioredoxin/thioredoxin- 
reductase; oil bodies comprising multimeric-protein-complexes, such as 
heteromultimeric-protein-complexes; isolated thioredoxin/thioredoxin-reductase 

10 fusion proteins; or isolated multimeric-protein-complexes. These pharmaceutical 
compositions may be used for the treatment of reperfusion injury (Aota et al. 
(1996) J. Cardiov. Pharmacol. (1996) 27: 727-732), cataracts (US Patent US 
4,771,036), chronic obstructive pulmonary disease (COPD) (MacNee et al. 
(1999) Am. J. Respir. Crit. Care Med. 160:S58-S65), diabetes (Hotta et al. J. 

15 Exp. Med. 188: 1445-1451), envenomation (PCT Patent Application 99/20122; 
US Patent 5,792,506), bronchopulmonary disease (MacNee (2000) Chest 
117:3035-3175); malignancies (PCT Patent Application 91/04320) and the 
alleviation of the allergenic potential of airborne, for example pollen- derived, and 
contact allergens (PCT Patent Application 00/44781). Other diseases or 

20 conditions that may be treated with the pharmaceutical compositions provided 
herein include: psoriasis, wound healing, sepsis, Gl bleeding, intestinal bowel 
disease (IBD), ulcers, transplantation, GERD (gastro esophageal reflux disease). 

In another embodiment, the pharmaceutical compositions provided herein, 
particularly those comprising one or more redox proteins alone or in combination 

25 with oil bodies, can be used in the treatment of inflammatory and viral diseases 
by reductively inactivating phospholipase A2, one of the contributing factors in 
inflammatory diseases. Additionally, the redox fusion polypeptide system has 
been found to function as a self-defense mechanism in response to 
environmental stimuli, including oxidative stress caused by UV-generated free 

30 radicals. Consequently, redox proteins, e.g., oleosin-thioredoxin, oleosin- 

thioredoxin-reductase, the various redox fusion polypeptides described herein, 
provide beneficial effects in certain skin conditions such as psoriasis, skin 
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cancer, dandruff, diaper rash, dermatitis, acne, sun damage, aging, 
inflammation, and the like. 

In another embodiment, oil-body-thioredoxin-related fusion proteins, e.g., 
oleosin-Thioredoxin-reductase, can also be used as a venom antidote. Many 
5 animal venoms and other toxins contain disulfide bonds, including all snake 
venom neurotoxins, some bacterial neurotoxins including tetanus and botulinum 
A, bee venom phospholipase A 2 , and scorpion venom. In a further embodiment, 
the redox protein related pharmaceutical compositions provided herein can be 
used to inactivate venom toxins by reduction of disulfide bonds. A method of 

10 treating an individual suffering from the effects of a venom or toxin can include 
the step of administering an effective dose of a pharmaceutical composition, in a 
pharmaceutical^ effective carrier in an amount sufficient to relieve or reverse the 
effects of the venom toxin on the individual. 

The pharmaceutical compositions provided herein are preferably 

15 formulated for single dosage administration. The concentrations of the 

compounds in the formulations are effective for delivery of an amount, upon 
administration, that is effective for the intended treatment. Typically, the 
compositions are formulated for single dosage administration. To formulate a 
composition, the weight fraction of a compound or mixture thereof is dissolved, 

20 suspended, dispersed or otherwise mixed in a selected vehicle at an effective 
concentration such that the treated condition is relieved or ameliorated. 
Pharmaceutical carriers or vehicles suitable for administration of the compounds 
provided herein include any such carriers known to those skilled in the art to be 
suitable for the particular mode of administration. 

25 In addition, the compounds may be formulated as the sole 

pharmaceutically active ingredient in the composition or may be combined with 
other active ingredients. Liposomal suspensions, including tissue-targeted 
liposomes, may also be suitable as pharmaceutically acceptable carriers. These 
may be prepared according to methods known to those skilled in the art. For 

30 example, liposome formulations may be prepared as described in U.S. Patent No. 
4,522,811. 
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The active compound is included in the pharmaceutical^ acceptable 
carrier in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. The therapeutically 
effective concentration may be determined empirically by testing the compounds 
5 in known in vitro and in vivo systems, such as the assays provided herein. 

The concentration of active compound in the drug composition will 
depend on absorption, inactivation and excretion rates of the active compound, 
the physicochemical characteristics of the compound, the dosage schedule, and 
amount administered as well as other factors known to those of skill in the art. 

10 Typically a therapeutically effective dosage is contemplated. The 

amounts administered may be on the order of 0.001 to 1 mg/ml, preferably 
about 0.005-0.05 mg/ml, more preferably about 0.01 mg/ml, of blood volume. 
Pharmaceutical dosage unit forms are prepared to provide from about 1 mg to 
about 1000 mg and preferably from about 10 to about 500 mg, more preferably 

15 about 25-75 mg of the essential active ingredient or a combination of essential 
ingredients per dosage unit form. The precise dosage can be empirically 
determined. 

The active ingredient may be administered at once, or may be divided into 
a number of smaller doses to be administered at intervals of time. It is 

20 understood that the precise dosage and duration of treatment is a function of the 
disease being treated and may be determined empirically using known testing 
protocols or by extrapolation from ]n vivo or in vitro test data. It is to be noted 
that concentrations and dosage values may also vary with the severity of the 
condition to be alleviated. It is to be further understood that for any particular 

25 subject, specific dosage regimens should be adjusted over time according to the 
individual need and the professional judgment of the person administering or 
supervising the administration of the compositions, and that the concentration 
ranges set forth herein are exemplary only and are not intended to limit the 
scope or use of the claimed compositions and combinations containing them. 

30 Preferred pharmaceutically acceptable derivatives include acids, salts, 

esters, hydrates, solvates and prodrug forms. The derivative is typically selected 
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such that its pharmacokinetic properties are superior to the corresponding 
neutral compound. 

Thus, effective concentrations or amounts of one or more of the 
compounds provided herein or pharmaceutically acceptable derivatives thereof 
5 are mixed with a suitable pharmaceutical carrier or vehicle for systemic, topical 
or local administration to form pharmaceutical compositions. Compounds are 
included in an amount effective for ameliorating or treating the disorder for 
which treatment is contemplated. The concentration of active compound in the 
composition will depend on absorption, inactivation, excretion rates of the active 

10 compound, the dosage schedule, amount administered, particular formulation as 
well as other factors known to those of skill in the art. 

Solutions or suspensions used for parenteral, intradermal, subcutaneous, 
or topical application can include any of the following components: a sterile 
diluent, such as water for injection, saline solution, fixed oil, polyethylene glycol, 

15 glycerine, propylene glycol or other synthetic solvent; antimicrobial agents, such 
as benzyl alcohol and methyl parabens; antioxidants, such as ascorbic acid and 
sodium bisulfite; chelating agents, such as ethylenediaminetetraacetic acid 
(EDTA); buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 

20 preparations can be enclosed in ampules, disposable syringes or single or 
multiple dose vials made of glass, plastic or other suitable material. 

In instances in which the compounds exhibit insufficient solubility, 
methods for solubilizing compounds may be used. Such methods are known to 
those of skill in this art, and include, but are not limited to, using cosolvents, 

25 such as dimethylsulfoxide (DMSO), using surfactants, such as Tween®, or 

dissolution in aqueous sodium bicarbonate. Derivatives of the compounds, such 
as prodrugs of the compounds may also be used in formulating effective 
pharmaceutical compositions. For ophthalmic indications, the compositions are 
formulated in an ophthalmically acceptable carrier. For the ophthalmic uses 

30 herein, local administration, either by topical administration or by injection is 
preferred. Time release formulations are also desirable. Typically, the 
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compositions are formulated for single dosage administration, so that a single 
dose administers an effective amount. 

Upon mixing or addition of the compound with the vehicle, the resulting 
mixture may be a solution, suspension, emulsion or other composition. The form 
5 of the resulting mixture depends upon a number of factors, including the 
intended mode of administration and the solubility of the compound in the 
selected carrier or vehicle. If necessary, pharmaceutical^ acceptable salts or 
other derivatives of the compounds are prepared. 

The compound is included in the pharmaceutical^ acceptable carrier in an 

10 amount sufficient to exert a therapeutically useful effect in the absence of 

undesirable side effects on the patient treated. It is understood that number and 
degree of side effects depends upon the condition for which the compounds are 
administered. For example, certain toxic and undesirable side effects are 
tolerated when treating life-threatening illnesses that would not be tolerated 

15 when treating disorders of lesser consequence. 

The compounds can also be mixed with other active materials, that do 
not impair the desired action, or with materials that supplement the desired 
action known to those of skill in the art. The formulations of the compounds 
• and agents for use herein include those suitable for oral, rectal, topical, 

20 inhalational, buccal {e.g., sublingual), parenteral {e.g., subcutaneous, 

intramuscular, intradermal, or intravenous), transdermal administration or any 
route. The most suitable route in any given case will depend on the nature and 
severity of the condition being treated and on the nature of the particular active 
compound which is being used. The formulations are provided for administration 

25 to humans and animals in unit dosage forms, such as tablets, capsules, pills, 

powders, granules, sterile parenteral solutions or suspensions, and oral solutions 
or suspensions, and oil-water emulsions containing suitable quantities of the 
compounds or pharmaceutical^ acceptable derivatives thereof. The 
pharmaceutically therapeutically active compounds and derivatives thereof are 

30 typically formulated and administered in unit-dosage forms or multiple-dosage 
forms. Unit-dose forms as used herein refers to physically discrete units suitable 
for human and animal subjects and packaged individually as is known in the art. 
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Each unit-dose contains a predetermined quantity of the therapeutically active 
compound sufficient to produce the desired therapeutic effect, in association 
with the required pharmaceutically acceptable carrier, vehicle or diluent. 
Examples of unit-dose forms include ampoules and syringes and individually 
5 packaged tablets or capsules. Unit-dose forms may be administered in fractions 
or multiples thereof. A multiple-dose form is a plurality of identical unit-dosage 
forms packaged in a single container to be administered in segregated unit-dose 
form. Examples of multiple-dose forms include vials, bottles of tablets or 
capsules or bottles of pints or gallons. Hence, multiple dose form is a multiple of 

10 unit-doses which are not segregated in packaging. 

The composition can contain along with the active ingredient: a diluent 
such as lactose, sucrose, dicalcium phosphate, or carboxymethylcellulose; a 
lubricant, such as magnesium stearate, calcium stearate and talc; and a binder 
such as starch, natural gums, such as gum acaciagelatin, glucose, molasses, 

15 polivinylpyrrolidine, celluloses and derivatives thereof, povidone, crospovidones 
and other such binders known to those of skill in the art. Liquid 
pharmaceutically administrable compositions can, for example, be prepared by 
dissolving, dispersing, or otherwise mixing an active compound as defined above 
and optional pharmaceutical adjuvants in a carrier, such as, for example, water, 

20 saline, aqueous dextrose, glycerol, glycols, ethanol, and the like, to thereby form 
a solution or suspension. If desired, the pharmaceutical composition to be 
administered may also contain minor amounts of nontoxic auxiliary substances 
such as wetting agents, emulsifying agents, or solubilizing agents, pH buffering 
agents and the like, for example, acetate, sodium citrate, cyclodextrine 

25 derivatives, sorbitan monolaurate, triethanolamine sodium acetate, 

triethanolamine oleate, and other such agents. Methods of preparing such 
dosage forms are known, or will be apparent, to those skilled in this art (see, 
e.g., Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, 
Pa., 15th Edition, 1975). The composition or formulation to be administered will 

30 contain a quantity of the active compound in an amount sufficient to- alleviate 
the symptoms of the treated subject. 
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Dosage forms or compositions containing active ingredient in the range of 
0.005% to 100% with the balance made up from non-toxic carrier may be 
prepared. For oral administration, the pharmaceutical compositions may take the 
form of, for example, tablets or capsules prepared by conventional means with 
5 pharmaceutical^ acceptable excipients such as binding agents (e.g., 
pregelatinized maize starch, polyvinyl pyrrolidone or hydroxypropyl 
methylcellulose); fillers [e.g., lactose, microcrystalline cellulose or calcium 
hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch glycolate); or wetting agents 
10 (e.g., sodium lauryl sulphate). The tablets may be coated by methods well- 
known in the art. 

The pharmaceutical preparation may also be in liquid form, for example, 
solutions, syrups or suspensions, or may be presented as a drug product for 
reconstitution with water or other suitable vehicle before use. Such liquid 

15 preparations may be prepared by conventional means with pharmaceutical^ 
acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose 
derivatives or hydrogenated edible fats); emulsifying agents (e.g., lecithin or 
acacia); non-aqueous vehicles (e.g., almond oil, oily esters, or fractionated 
vegetable oils); and preservatives (e.g., methyl or propyl-p-hydroxybenzoates or 

20 sorbic acid). 

Formulations suitable for rectal administration are preferably presented as 
unit dose suppositories. These may be prepared by admixing the active 
compound with one or more conventional solid carriers, for example, cocoa 
butter, and then shaping the resulting mixture. 

25 Formulations suitable for topical application to the skin or to the eye 

preferably take the form of an ointment, cream, lotion, paste, gel, spray, aerosol 
and oil. Carriers which may be used include vaseline, lanoline, polyethylene 
glycols, alcohols, and combinations of two or more thereof. The topical 
formulations may further advantageously contain 0.05 to 1 5 percent by weight 

30 of thickeners selected from among hydroxypropyl methyl cellulose, methyl 
cellulose, polyvinylpyrrolidone, polyvinyl alcohol, poly (alkylene glycols), 
poly/hydroxyalkyl, (meth)acrylates or poly(meth)acrylamides. A topical 
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formulation is often applied by instillation or as an ointment into the conjunctival 
sac. It can also be used for irrigation or lubrication of the eye, facial sinuses, 
and external auditory meatus. It may also be injected into the anterior eye 
chamber and other places. The topical formulations in the liquid state may be 
5 also present in a hydrophilic three-dimensional polymer matrix in the form of a 
strip, contact lens, and the like from which the active components are released. 

For administration by inhalation, the compounds for use herein can be 
delivered in the form of an aerosol spray presentation from pressurized packs or 
a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 

10 trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other 
suitable gas. In the case of a pressurized aerosol, the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of, e.g., gelatin, for use in an inhaler or insufflator may be formulated 
containing a powder mix of the compound and a suitable powder base such as 

15 lactose or starch. 

Formulations suitable for buccal (sublingual) administration include, for 
example, lozenges containing the active compound in a flavored base, usually 
sucrose and acacia or tragacanth; and pastilles containing the compound in an 
inert base such as gelatin and glycerin or sucrose and acacia. 

20 The compounds may be formulated for parenteral administration by 

injection, e.g., by bolus injection or continuous infusion! Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose 
containers, with an added preservative. The compositions may be suspensions, 
solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 

25 agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active ingredient may be in powder form for reconstitution with a suitable 
vehicle, e.g., sterile pyrogen-free water or other solvents, before use. 

Formulations suitable for transdermal administration may be presented as 
discrete patches adapted to remain in intimate contact with the epidermis of the 

30 recipient for a prolonged period of time. Such patches suitably contain the 

active compound as an optionally buffered aqueous solution of, for example, 0.1 
to 0.2 M concentration with respect to the active compound. Formulations 
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suitable for transdermal administration may also be delivered by iontophoresis 
{see, e.g., Pharmaceutical Research 3 (6), 318 (1986)) and typically take the 
form of an optionally buffered aqueous solution of the active compound. 

The pharmaceutical compositions may also be administered by controlled 
5 release means and/or delivery devices (see, e.g., in U.S. Patent Nos. 3,536,809; 
3,598,123; 3,630,200; 3,845,770; 3,847,770; 3,916,899; 4,008,719; 
4,687,610; 4,769,027; 5,059,595; 5,073,543; 5,120,548; 5,354,566; 
5,591,767; 5,639,476; 5,674,533 and 5,733,566). 

Desirable blood levels may be maintained by a continuous infusion of the 
10 active agent as ascertained by plasma levels. It should be noted that the 
attending physician would know how to and when to terminate, interrupt or 
adjust therapy to lower dosage due to toxicity, or bone marrow, liver or kidney 
dysfunctions. Conversely, the attending physician would also know how to and 
when to adjust treatment to higher levels if the clinical response is not adequate 
15 (precluding toxic side effects). 

The efficacy and/or toxicity of the pharmaceutical compositions provided 
herein, alone or in combination with other agents can also be assessed by the 
methods known in the art (See generally, O'Reilly, Investigational New Drugs, 
15:5-13 (1997)). 

20 The active compounds or pharmaceutical^ acceptable derivatives may be 

prepared with carriers that protect the compound against rapid elimination from 
the body, such as time release formulations or coatings. 

Kits containing the compositions and/or the combinations with 
instructions for administration thereof are provided. The kit may further include 

25 a needle or syringe, preferably packaged in sterile form, for injecting the 

complex, and/or a packaged alcohol pad. Instructions are optionally included for 
administration of the active agent by a clinician or by the patient. 

Finally, the pharmaceutical compositions provided herein containing any 
of the preceding agents may be packaged as articles of manufacture containing 

30 packaging material, a compound or suitable derivative thereof provided herein, 
which is effective for treatment of a diseases or disorders contemplated herein, 
within the packaging material, and a label that indicates that the compound or a 
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suitable derivative thereof is for treating the diseases or disorders contemplated 
herein. The label can optionally include the disorders for which the therapy is 
warranted. 

Also provided herein are personal care formulations containing oil bodies 
5 comprising a thioredoxin/thioredoxin-reductase fusion polypeptide. Personal care 
products comprising thioredoxin and thioredoxin-reductase are disclosed in for 
example Japanese Patent Applications JP9012471 A2, JP103743A2, and 
JP1 129785A2 Personal care formulations that may be prepared in accordance 
with the present invention include formulations capable of improving the physical 
10 appearance of skin exposed to detrimental environmental stimuli resulting in 
oxidative stress for example oxidative stress caused by UV-generated free- 
radicals. The oil bodies comprising thioredoxtn/thioredoxin-reductase may also 
be used to prepare hair care products as described in US Patent Nos. 4,935,231 
and 4,973,475 (incorporated herein by reference in their entirety). 
15 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

Isolation of thioredoxin and NADPH thioredoxin-reductase genes 

An Arabidopsis silique cDNA library CD4-1 2 was obtained from the 

20 Arabidopsis Biological Resource Centre (ABRC, http://aims.cps.msu.edu) 
Arabidopsis stock centre and used as a template for the isolation of the 
thioredoxin h (Trxh) and thioredoxin-reductase genes from Arabidopsis. For the 
isolation of the Trxh gene the following primers were synthesized: 
GVR833: 5' TA CC ATG G CTTCG G AAG A AGG A 3' (SEQ ID NO:1) 

25 The sequence identical to the 5' end of the Trxh gene as published in 

Rivera-Madrid et al, (1993) Plant Physiol 102: 327-328, is indicated in bold. 
Underlined is an Ncol restriction site to facilitate cloning. GVR834: 5' 
GA AAGCTTA AGCCAAGTGTTTG 3' (SEQ ID NO:2) 

The sequence complementary to the 3' end of the Trxh gene as published 

30 in Rivera-Madrid et al, (1993) Plant Physiol 102: 327-328, is indicated in bold. 
Underlined is an Hindlll restriction site to facilitate cloning. 
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A Polymerase Chain Reaction (PCR) was carried out using GVR833 and 
GVR834 as primers and the cDNA library CD4-12 as a template. The resulted 
PCR fragment was isolated, cloned into pBluescript and sequenced. The isolated 
sequence encoding Trxh was identical to the published Trxh gene sequence 
5 (Rivera-Madrid et al, (1993) Plant Physiol 102: 327-328). The pBluescript vector 
containing the Trxh gene is called pSBS2500. 

For the isolation of the thioredoxin-reductase gene the following primers 
were synthesized: 

GVR836: 5' GGCCAGCACACTACCATGAATGGTCTCGAAACTCAC 3' (SEQ ID 
10 NO:3). The sequence identical to the 5' end of the thioredoxin-reductase gene 
as published (Jacquot et al, J Mol Biol. (1994) 235 (4):1 357-63), is indicated in 
bold). 

GVR837: 5' TTAAGCTTCAATCACTCTTACCTTGCTG 3' (SEQ ID N0:4). 

A Polymerase Chain Reaction (PCR) was carried out using GVR836 and 
15 GVR837 as primers and the cDNA library CD4-12 as a template. The resulted 
PCR fragment was isolated, cloned into pBluescript and sequenced. The 
pBluescript vector containing the thioredoxin-reductase gene is called 
pSBS2502. 

A total of three clones were sequenced, the sequence of each of the 
20 three clones were identical to each other. However, as depicted in Figure 1 this 
sequence indicated several nucleotide differences compared to the published 
thioredoxin-reductase gene sequence published (Jacquot et al, J Mol Biol. (1994) 
235 (4):1 357-63.). The complete coding sequence and its deduced amino acid 
sequence is shown in SEQ ID NO: 10. As a result of the nucleotide differences 
25 between the published sequence and the sequence isolated in Example 1 , 
several amino acid changes are also predicted. A comparison of the deduced 
amino acid sequence of the published NADPH thioredoxin-reductase sequence 
thioredoxin-reductase (ATTHIREDB, Jacquot et al, J Mol Biol, (1994) 235 
(4):1 357-63.) with the sequence isolated in Example 1 (TR) is shown in Figure 3. 



30 
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EXAMPLE 2 
Construction of plant expression vectors. 

Expression vectors were constructed to allow for the seed specific over- 
expression of thioredoxin and NADPH thioredoxin-reductase in seeds. Vectors 
5 were constructed to allow for over-expression in its natural subcellular location 
and for accumulation on oil bodies. 

Construction of plant transformation vector pSBS2520. 
The Arabidopsis thioredoxin h gene as described in example 1 was placed under 
the regulatory control of the phaseolin promoter and the phaseolin terminator 

10 derived from the common bean Phaseolus vulgaris (Slightom et al (1983) Proc. 
Natl Acad Sc USA 80: 1897-1901; Sengupta-Gopalan eta/., (1985) PNAS USA 
82: 3320-3324)). A gene splicing by overlap extension technique (Horton et al 
(1989) 1 5: 61-68) was used to fuse the phaseolin promoter to the Trxh gene. 
Standard molecular biology laboratory techniques (see eg: Sambrook et al. ( 

15 (1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor Press) were used to furnish 
the phaseolin promoter and terminator with Pst I and HindlH/Kpnl sites 
respectively (see SEQ ID NO:14). Standard molecular biology laboratory 
techniques were also used to place the phaseolin terminator downstream from 
the Trxh gene. The Pstl-phaseolin promoter- Trxh-phaseolin terminator-Kpnl 

20 insert sequence was cloned into the Pstl-Kpnl sites of pSBS3000 (pSBS3000 is 
a derivative from the Agrobacterium binary plasmid pPZP221 (Hajdukiewicz et 
al., 1994, Plant Molec. Biol. 25: 98*9-994). In pSBS3000, the CaMV35S 
promoter-gentamycin resistance gene-CAMV 35S terminator of pPZP221 was 
replaced with parsley ubiquitin promoter-phosphinothricin acetyl transferase 

25 gene-parsley ubiquitin termination sequence to confer resistance to the herbicide 
glufosinate ammonium.) The resulting plasmid is called pSBS2520. The 
sequence of the phaseolin promotet-Arabidopsis Trxh-phaseolin terminator 
sequence is shown in SEQ ID NO:14. 

Construction of plant transformation vector pSBS2510. 

30 The 3' coding sequence of an Arabidopsis oleosin gene (van Rooijen et al (1992) 
Plant Mol. Biol. 18: 1 177-1 179) was altered to contain an Ncol site. The Ncol- 
Hindlll fragment from v ctor pSBS2500 (Example 1) containing the Trxh was 
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ligated to the coding sequence of this Arabidopsis oleosin utilizing this Ncol 
restriction site. A gene splicing by overlap extension technique (Horton et al 

(1989) 15: 61-68) was used to fuse the phaseolin promoter (Slightom et af 
(1983) Proc. Natl Acad Sc USA 80: 1897-1901; Sengupta-Gopalan et <?/., 

5 (1985) PNAS USA 82: 3320-3324) containing a synthetic Pstl site (see 

construction of pSBS2520) to the coding sequence of the Arabidopsis oleosin. 
Standard molecular biology laboratory techniques (see eg: Sambrook et al. 

(1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor Press) were again used to 
clone the Hindlll Kpnl fragment containing the phaseolin terminator (see 

10 construction of pSBS2520) downstream of the Trxh gene. The Pstl-phaseolin 
promoter- oleosin- Trxh-phaseolin terminator-Kpnl insert sequence was cloned 
into the Pstl-Kpnl sites of pSBS3000. The resulting plasmid is called pSBS2510. 
The sequence of the phaseolin promoter-oleosin Trxh-phaseolin terminator 
sequence is shown in SEQ ID NO:16. 

1 5 Construction of plant transformation vector pSBS252 1. 

This vector contains the same genetic elements as the insert of pSBS2510 
except the Trxh gene is fused to the 5' end of the oleosin gene.The 3' oleosin 
coding sequence including its native stopcodon (van Rooijen et al (1992) Plant 
Mol. Biol.18: 1 177-1179) was furnished with a Hindlll cloning site. Again a 

20 gene splicing by overlap extension technique (Horton et al (1989) 15: 61-68) 
was used to fuse the phaseolin promoter to the Trxh gene and to fuse the Trxh 
gene to the oleosin sequence. Standard molecular biology laboratory techniques 
(see eg: Sambrook et al. (1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor 
Press) were again used to clone the Hindlll Kpnl fragment containing the 

25 phaseolin terminator (see construction of pSBS2520) downstream of the oleosin 
gene. The Pstl-phaseolin promoter- Trxh oleosin- phaseolin terminator-Kpnl 
insert sequence was cloned into the Pstl-Kpnl sites of pSBS3000. The resulting 
plasmid is called pSBS2521 . The sequence of the phaseolin promoter- Trxh 
oleosin -phaseolin terminator sequence is shown in SEQ ID NO: 19. 

30 Construction of plant transformation vector pSBS2527. 

The Arabidopsis NADPH thioredoxin-reductase gene as described in example 1 
was placed under the regulatory control of the phaseolin promoter and the 
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phaseolin terminator derived from the common bean Phaseolus vulgaris 
(Slightom et al (1983) Proc. Natl Acad Sc USA 80: 1897-1901; Sengupta- 
Gopalan etal., (1985) PNAS USA 82: 3320-3324). A gene splicing by overlap 
extension technique (Horton et al (1989) 15: 61-68) was used to fuse the 
5 phaseolin promoter to the thioredoxin-reductase gene. Standard molecular 

biology laboratory techniques (see eg: Sambrook etal. (1990) Molecular Cloning, 
2 nd ed. Cold Spring Harbor Press) were used to furnish the phaseolin promoter 
and terminator with Pstl and Hindlll/Kpnl sites respectively (see SEQ ID N0:14). 
Standard molecular biology laboratory techniques were also used to place the 

10 phaseolin terminator downstream from the thioredoxin-reductase gene. The Pstl- 
phaseolin promoter-thioredoxin-reductase-phaseolin terminator-Kpnl insert 
sequence was cloned into the Pstl-Kpnl sites of pSBS3000 The resulting plasmid 
is called pSBS2527. The sequence of the phaseolin promoter-Arabidopsis 
thioredoxin-reductase-phaseolin terminator sequence is shown in SEQ ID NO.-22. 

1 5 Construction of plant transformation vector pSBS2531. 

A gene splicing by overlap extension technique (Horton et al (1989) 15: 61-68) 
was used to fuse the phaseolin promoter (Slightom et al (1983) Proc. Natl Acad 
Sc USA 80: 1897-1901 ; Sengupta-Gopalan etaL, (1985) PNAS USA 82: 3320- 
3324) to the coding sequence of the Arabidopsis oleosin. The same gene 

20 splicing technique was used to fuse the oleosin gene to the thioredoxin- 
reductase coding sequence. Standard molecular biology laboratory techniques 
(see eg: Sambrook etal. (1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor 
Press) were again used to clone the Hindlll Kpnl fragment containing the 
phaseolin downstream of the thioredoxin-reductase gene. The Pstl-phaseolin 

25 promoter- oleosin- thioredoxin-reductase -phaseolin terminator-Kpnl insert 
sequence was cloned into the Pstl-Kpnl sites of pSBS3000, The resulting 
plasmid is called pSBS2531. The sequence of the phaseolin promoter-oleosin 
thioredoxin-reductase -phaseolin terminator sequence is shown in SEQ ID NO:24. 
Construction of plant transformation vector pSBS2529 

30 This vector contains the same genetic elements as the insert of pSBS2531 

except the thioredoxin-reductase gene is fused to the 5' end of the oleosin gene. 
The 3' oleosin coding sequence including its native stopcodon (van Rooijen et al. 
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(1992) Plant Mol. Biol. 18: 1 177-1 179) was furnished with a Hindlll cloning site. 
Again a gene splicing by overlap extension technique (Horton et al (1989) 15: 
61-68) was used to fuse the phaseolin promoter to the thioredoxin-reductase 
gene and to fuse the thioredoxin-reductase gene to the oleosin sequence. 
5 Standard molecular biology laboratory techniques (see eg: Sambrook et al. 
(1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor Press) were again used to 
clone the Hindlll Kpnl fragment containing the phaseolin terminator (see 
construction of pSBS2520) downstream of the oleosin gene. The Pstl-phaseolin 
promoter- thioredoxin-reductase oleosin- phaseolin terminator-Kpnl insert 
10 sequence was cloned into the Pstl-Kpnl sites of pSBS3000. The resulting 
plasmid is called pSBS2529. The sequence of the phaseolin promoter- 
thioredoxin-reductase oleosin -phaseolin terminator sequence is shown in SEQ ID 
NO:27. 

Construction of plant transformation vector pSBS2530. 

15 A plant transformation was constructed containing the Mycobacterium Leprae 
thioredoxin-reductase /thioredoxin gene (Mlep TR/Trxh). A construct called 
pHIS/TR/Trxh (Wieles et al (1995) J Biol Chem 270:25604-25606) was obtained 
from the department of Immunohematology and Blood bank, Leiden University, 
The Netherlands and use as a template for PCR to generate pSBS2530, The 

20 construction of pSBS2530 was identical to the construction of pSBS2531 
except that the Mlep TR/Trxh gene was used instead of the Arabidopsis 
thioredoxin-reductase gene. A gene splicing by overlap extension technique 
(Horton et al (1989) 15: 61-68) was used to fuse the phaseolin promoter 
(Slightom et al (1983) Proc. Natl Acad Sc USA 80: 1897-1901; Sengupta- 

25 Gopalan etaL, (1985) PNAS USA 82: 3320-3324) to the coding sequence of 
the Arabidopsis oleosin. The same gene splicing technique was used to fuse the 
oleosin gene to the Mlep TR/Trxh coding sequence. Standard molecular biology 
laboratory techniques (see eg: Sambrook etaL (1990) Molecular Cloning, 2 nd ed. 
Cold Spring Harbor Press) were again used to clone the Hindlll-Kpnl fragment 

30 containing the phaseolin downstream of the Mlep TR/T rxh gene. The Pstl- 
phaseolin promoter- oleosin- Mlep TR/Trxh -phaseolin terminator-Kpnl insert 
sequence was cloned into the Pstl-Kpnl sites of pSBS3000. The resulting 
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plasmid is called pSBS2530. The sequence of the phaseolin promoter-oleosin 
Mlep TR/Trxh -phaseolin terminator sequence is shown in SEQ ID NO:30. 

Construction of plant transformation vector pSBS2542. 
From initial activity assays (Figure 4), it was apparent that oil bodies expressing 
5 the oleosin-M. lep TR/Trxh fusion protein contained considerable reducing 

activity. It was anticipated that a similar oleosin fusion construct encoding the 
Arabidopsis thioredoxin-reductase and thioredoxin proteins would behave in an 
analogous manner. Molecular modeling was used to aid in the design of such a 
construct. Primers were designed (thioredoxin link-L: 5'- 
10 ■ ACTGGAGATGTTGACTCGACGGATACTACGGATTGGTCGACGG 

CTATGGAAGAAGGACAAGTGATCGCCTGC-3'; (SEQ ID NO:5), and thioredoxin 
link-R: 

B'-ATCCGTCGAGTCAACATCTCCAGTTTCCTCGGTGGTCTCGTTAGCCTTCGAT 
CC A G C A ATCTCTTGTA AG AATG CTCTG C-3 ' ; (SEQ ID NO:6) to code for a 

15 synthetic linker peptide between the thioredoxin-reductase and thioredoxin 
proteins. These primers were used in conjunction with primers GVR 873 (5'- 
GTGGAAGCT TATGGAGATGGAG-3'; SEQ ID NO:7) and GVR834 (5'- 
GAAAGCTTAAGCCAAGTGTTTG-3'; SEQ ID NO:2) to amplify a region coding 
for a thioredoxin-reductase-linker region-thioredoxin utilizing a gene splicing by 

20 overlap extension technique (Horton et al (1989) 15:61-68). The thioredoxin- 
reductase-linker-thioredoxin encoding sequence was then cloned into a pre- 
existing pSBS3000 vector using standard molecular biology techniques 
(Sambrook et al (1990) Molecular Cloning 2 nd Edition Cold Spring Harbour Press). 
The resulting plasmid was called pSBS2542. The sequence of the phaseolin 

25 promoter-oleosin-thioredoxin-reductase-linker-thioredoxin-phaseolin terminator 
region is shown in SEQ ID NO:33. An amino acid sequence comparison 
between this Arabidopsis thioredoxin-reductase-linker-thioredoxin and the M. 
leprae TR/Trxh protein is shown in Figure 1 2. 

Plasmids pSBS2510, pSBS2520 f pSBS2521, pSBS2527, pSBS2529, 

30 pSBS2530, pSBS2531 and pSBS2542 were electroporated into Agrobacterium 
strain EHA101 . These Agrobacterium strains were used to transform 
Arabidopsis. Arabidopsis transformation was done essentially as described in 
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"Arabidopsis Protocols; Methods in molecular biology Vol 82. Edited by 
Martinez-Zapater JM and Salinas J. ISBN 0-89603-391-0 pg 259-266 (1998) 
except the putative transgenic plants were selected on agarose plates containing 
80//M L-phosphinothricine, after they were transplanted to soil and allowed to 
5 set seed. 

EXAMPLE 3 

Polvacrvlamide qelelectrophoresis and Immunoblottina of transgenic seed 
extracts. 

10 Source of Arabidopsis thioredoxin, thioredoxin reductase and oleosin 

antibodies. 

The Arabidopsis thioredoxin and thioredoxin-reductase genes were cloned in 
frame in bacterial expression vector pRSETB {Invitrogen) to allow for the 
overexpression of Arabidopsis thioredoxin and thioredoxin-reductase proteins. 

15 These proteins were purified using standard protocols (see eg Invitrogen 
protocol) and used to raise antibodies in rabbits using standard biochemical 
techniques (See eg Current Protocols in Molecular Biology, John Wiley & Sons, 
N,Y, (1989). The Arabidopsis oleosin gene genes was cloned in frame in 
bacterial expression vector pRSETB (Invitrogen) to allow for the overexpression 

20 Arabidopsis oleosin protein. This protein was purified using standard protocols 
(see eg Invitrogen protocol) and used to prepare mouse monoclonal antibodies 
using standard biochemical techniques (See eg Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989). 

Preparation of total Arabidopsis seed extracts for PA GE. Arabidopsis 

25 seeds were ground in approximately 20 volumes of 2% SDS, 50 mM Tris-CI„ 
this extract was boiled, spun and the supernatant was prepared for 
polyacrylamide gelelectrophoresis (PAGE) using standard protocols. 
Preparation of Arabidopsis oil-body-protein extracts. 

Arabidopsis seeds were ground in approximately 20 volumes of water and spun 
30 in a microfuge. The oil bodies were recovered and washed sequentially with 
approximately 20 volumes of water, a high stringency wash buffer, containing 
8M urea and 100 mM sodiumcarbonate and water. After this last wash the 
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oil bodies are prepared for poly acrylamide gelelectrophoresis (PAGE) using 
standard protocols. 

Analysis of seed and oil body extracts from plants transformed with 
pSBS2510 

5 Total seed and oil body protein extracts from plants transformed with 
pSBS2510 were loaded onto polyacrylamide gels and either stained with 
coomassie brilliant blue or electroblotted onto PVDF membranes. The membranes 
were challenged with a polyclonal antibody raised against Arabidopsis 
thioredoxin, or a monoclonal antibody raised against the Arabidopsis 18.5 kDa 

10 oleosin and visualized using alkaline phosphatase. Expression of the oleosin- 
thioredoxin results in an additional band of 31.2 kDa. The results indicate that 
the thioredoxin antibodies are immunologically reactive with a band of the right 
predicted molecular weight (31.2 kDa), and the oleosin antibodies are also 
immunologically reactive with a band of the right predicted molecular weight for 

15 the fusion protein (31 .2 kDa) in addition to a band corresponding to the native 
Arabidopsis oleosin (18.5 kDa). This indicates that oleosin-thioredoxin is 
expressed in Arabidopsis seeds and is correctly targeted to oil bodies. 

Analysis of seed and oil body extracts from plants transformed with 
PSBS2521 

20 Total seed and oil body protein extracts from plants transformed with 

pSBS25121 were loaded onto polyacrylamide gels and either stained with 
Coomassie brilliant blue or electroblotted onto PVDF membranes. The 
membranes were challenged with a polyclonal antibody raised against 
Arabidopsis thioredoxin, or a monoclonal antibody raised against the Arabidopsis 

25 18,5 kDa oleosin and visualized using alkaline phosphatase. Expression of the 
thioredoxin-oleosin results in an additional band of 31.2 kDa. The results 
indicate that the thioredoxin antibodies are immunologically reactive with a band 
of the right predicted molecular weight (31.2 kDa), and the oleosin antibodies 
are also immunologically reactive with a band of the right predicted molecular 

30 weight for the fusion protein (31 .2 kDa) in addition to a band corresponding to 
the native Arabidopsis oleosin (18.5 kDa). This indicates that thioredoxin- 
oleosin is expressed in Arabidopsis seeds and is correctly targeted to oil bodies. 

RECTIFIED SHEET (RULE 91) 



WO 02/50289 



PCTVUS01/50240 



-88- 

Analysis of seed extracts from plants transformed with pSBS2520 Total 
seed extracts from plants transformed with pSBS2520 were loaded onto 
polyacrylamide gels and either stained with Coomassie brilliant blue or 
electroblotted onto PVDF membranes. The membranes were challenged with a 
5 polyclonal antibody raised against Arabidopsis thioredoxin and visualized using 
alkaline phosphatase. The results indicated that the thioredoxin antibodies are 
immunologically reactive with a band of approximately the right predicted 
molecular weight (12 kDa). Untransformed seeds do not show a detectable 
thioredoxin band. 

1 0 Analysis of seed and oil body extracts from plants transformed with 

pSBS2529 

Total seed and oilbody protein extracts from plants transformed with pSBS2529 
were loaded onto polyacrylamide gels and electroblotted onto PVDF membranes. 
The membranes were challenged with a polyclonal antibody raised against 

1 5 Arabidopsis thioredoxin-reductase, or a monoclonal antibody raised against the 
Arabidopsis 18.5 kDa oleosin and visualized using alkaline phosphatase. 
Expression of the thioredoxin-reductase -oleosin results in an additional band of 
53.8 kDa. The results indicate that the thioredoxin-reductase antibodies are 
immunologically reactive with a band of the right predicted molecular weight for 

20 the fusion protein (53.8 kDa), the oleosin antibodies are also immunologically 
reactive with a band of the right predicted molecular weight (53.8 kDa) in 
addition to a band corresponding to the native Arabidopsis oleosin (18.5 kDa). 
This indicates that thioredoxin-reductase-oleosin is expressed in Arabidopsis 
seeds. 

25 Analysis of seed extracts from plants transformed with pSBS2527 Total 

seed extracts from plants transformed with pSBS2527 were loaded onto 
polyacrylamide gels and electroblotted onto PVDF membranes. The membranes 
were challenged with a polyclonal antibody raised against Arabidopsis 
thioredoxin-reductase and visualized using alkaline phosphatase. The 

30 thioredoxin-reductase antibodies are immunologically reactive with a band of 
approximately the right predicted molecular weight for the (35.3 kDa). 
Untransformed seeds do not show a detectable thioredoxin band, 
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Analysis of seed extracts from plants transformed with pSBS2S37 A 
protein gel and immunoblot was prepared assaying the expression of oleosin- 
DMSR in Arabidopsis T2 seeds and correct targeting to Arabidopsis oil bodies. 
The expected molecular weight based on the deduced amino acid sequence is 
5 calculated to be 53,817 Da. In the oil body extract of the transgenic oleosin- 
thioredoxin-reductase sample an extra band of approximately 54 kDa was 
observed. This band was confirmed to be oleosin-thioredoxin-reductase by 
immunoblotting. From the polyacryiamide gel it was observed that the 
expression of the oleosin -Thioredoxin-reductase is about double compared to 

10 the expression of the major 18.5 kDa Arabidopsis oleosin. This represents 
approximately 2-4 % of total seed protein. 

Analysis of seed extracts from plants transformed with pSBS2530 A 
protein gel and immunoblot was prepared assaying the expression of oleosin- 
M.lep TR/Trxh in Arabidopsis 12 seeds and the correct targeting to Arabidopsis 

1 5 oil bodies. The expected molecular weight based on the deduced amino acid 
sequence is calculated to be 67,550 Da. In the oil body extract of the 
transgenic oleosin-M/ep TR/Trxh sample an extra band of approximately 68 kDa 
was observed. This band was confirmed to be oleosin-M./ep TR/Trxh by 
immunoblotting. From the polyacryiamide gel it was observed that the 

20 expression of the oleosin-ZW./ep TR/Trxh is similar to the expression of the major 
18.5 kDa Arabidopsis oleosin. This represents approximately 1-2 % of total 
seed protein. 

Analysis of seed extracts from plants transformed with pSBS2542 Crude 
oil body extracts from pSBS2542 lines were prepared by grinding 100pg of seed 

25 in 1 mL of lOOmM Tris buffer at pH 7.5, The samples were then centrifuged in 
order to isolate the oil body fraction. The oil body fraction was then loaded on 
an SDS polyacryiamide gel for expression analysis. A Coomassie stained gel 
revealed that the synthetic fusion accumulated to high levels in crude oil body 
extracts from 3 of the 4 lines tested. It was estimated that the fusion protein 

30 represented approximately 2-5% of total seed protein. Furthermore, Western 
blots utilizing either anti-thioredoxin or anti-thioredoxin-reductase antibodies 
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confirmed that the over expressed 70 kDa protein was indeed oleosin- 
thioredoxin-reductase-linker-thioredoxin. 

EXAMPLE 4 

5 Biological activity of thioredoxin and thioredoxin-reductase transformants 

Initial reduction assays: 
DTNB assay 

The activity of the thioredoxin and thioredoxin reductase was determined 
using a colorimetric DTNB [5,5'-dithiolbis (2-nitrobenzoic acid)] assay. The 

10 assay was performed in a 700 //L reaction volume containing 100mM Tris-CI pH 
8.0, 5 mM EDTA, 200//M DTNB [5,5'-dithiolbis (2-nitrobenzoic acid)] and 200//M 
NADPH. If thioredoxin-reductase and thioredoxin are added, NADPH will reduce 
the thioredoxin-reductase, which will then reduce thioredoxin, which will, in 
turn, reduce DTNB (see equations below). 

15 NADPH 2 + thioredoxin-reductase ox — > thioredoxin-reductase red + NADP+ 

thioredoxin-reductase^ + thioredoxin ox > thioredoxin rod + thioredoxin- 

reductase ox 

thioredoxin red + DTNB 0X > 2(2-nitro-5-mercaptobenzoic acid) + 

thioredoxin ox 

20 The formation of the yellow product was monitored by measuring the 

OD 412 in a spectrophotometer after a set period of time (usually 0.5-2 hours). 
The results of initial activity assays are shown in the bar graph in Figure 4 and 
described below. 

Initially, 1 00 jjg of total seed proteins were added from each of the 
25 Arabidopsis transgenic lines, pSBS2520 (cytosolic thioredoxin) and pSBS2527 
(cytosolic thioredoxin-reductase), which corresponds to approximately 1 jjg of 
cytosolic thioredoxin and thioredoxin-reductase used in the assay. In this case, 
the amount of DTNB reduced was comparable to the reduction caused by 1 jjg 
each of £ coli thioredoxin and thioredoxin-reductase. In these plant seed 
30 samples, background readings were very low when only one of the 2 extracts 
(either cytosolic thioredoxin or cytosolic thioredoxin-reductase; Figure 4, bars 3 
and 6, respectively) was added to the reaction, along with wild type oil bodies. 
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Analysis with oil body fractions from transgenic seeds revealed that 
Arabidopsis thioredoxin and thioredoxin-reductase were substantially less active 
when fused to oleosins on oil bodies. Approximately 300 pg of crude, 
unwashed oil-body-protein was used in the assay (which corresponds to 1 0- 
5 30jug of thioredoxin-oleosin (pSBS 2521; Figure 4, bar 2), oleosin-thioredoxin 
(pSBS 2510, Figure 4, bar 1), thioredoxin-reductase-oleosin (pSBS 2529, Figure 
4, bar 5), or oleosin-thioredoxin-reductase (pSBS 2531, Figure 4, bar 4). The oil- 
body-proteins were tested in conjunction with 1 00jt/g of total seed protein 
containing approximately 1//g of cytosolic thioredoxin (pSBS 2520) or 

10 thioredoxin-reductase (pSBS 2527). 

In such assays, pSBS2529 (thioredoxin-reductase-oleosin) and pSBS2531 
(oleosin-thioredoxin-reductase) do contain reductase activity when combined 
with cytosolic thioredoxin from pSBS2520 (see Figure 4, bars 7 and 8, 
respectively). Experiments estimated that the reductase activity of oleosin- 

15 thioredoxin-reductase was about 10-15% that of the cytosolic thioredoxin- 
reductase. The addition of tween at a final concentration of 0.4% could 
enhance this activity 2 or 3 fold. Interestingly, oleosin-thioredoxin-reductase 
(pSBS 2531) appears to be capable of reducing DTNB in the absence of added 
thioredoxin, although added thioredoxin causes significantly more DTNB 

20 reduction {see Figure 4; compare bar 4 W.T. + oleosin-thioredoxin-reductase to 
bar 7 thioredoxin + oleosin-thioredoxin-reductase). Experiments with pSBS2521 
(thioredoxin-oleosin) orpSBS2510 (oleosin-thioredoxin) combined with cytosolic 
thioredoxin-reductase from pSBS2527 (see Figure 4, bars 10 and 11, 
respectively) indicate that thioredoxin activity of these fusions is undetectable at 

25 these concentrations. 

Oil bodies from the transgenic Arabidopsis line, pSBS2530 (o\eos\n-MJep 
TR/Trxh) contain significant thioredoxin/thioredoxin-reductase activity (see Figure 
4, bar 12). One hundred micrograms of crude oil body protein for pSBS2530 
was tested (corresponding to approximately 5//g of oleosin- MJep TR/trxh 

30 fusion) in the assay. Based on the assay, it was estimated that this fusion is 
about 25-40% as active as cytosolic Arabidopsis thioredoxin and thioredoxin- 
reductase (Figure 4, bar 9) when comparing specific activity. 
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Insulin reduction assay 

The results from the DTNB assays were confirmed with insulin reduction 
assays. This assay contained insulin at a final concentration of 1mg/mL in 
100mM KH 2 P0 4 pH 7.0 + 5mM EDTA. In the presence of NADPH (500//M), 
5 thioredoxin, and thioredoxin-reductase, insulin is reduced and precipitates from 
the solution. Normally, insulin reduction is followed by measuring turbidity at 
OD 650. Alternatively, one can measure the conversion of NADPH 2 to NADP + 
by monitoring the decrease in absorbance at 340 nm. 

Both of the assays are difficult to measure when oil bodies are present, 

10 due to interference with the spectrophotometer readings. However, qualitative 
data could be obtained by centrifuging the tubes after a set period of time, and 
determining if an insulin pellet was present (oil bodies float to the top, while the 
insulin precipitate pellets out). Alternatively, samples could be filtered after a set 
period of time, and the change in absorbance at 340 nm could be measured. As 

15 mentioned previously, the results of the insulin reduction assays agreed with 
those of the DTNB assay, with the exception of the observation that pSBS2531 
(oleosin-thioredoxin-reductase) only reduced insulin in the presence of free 
thioredoxin from pSBS2520. 

Assays on seeds from Arabidopsis crosses that co-express oleosin- 

20 thioredoxin and oleosin-thioredoxin-reductase. 

Based upon initial DTNB and insulin reduction assays, it was apparent that 
mixing oil bodies from oleosin<-> thioredoxin and oleosin <->thioredoxin- 
reductase transgenic seeds resulted in very limited reducing activity (Note: the 
<-> indicates both configurations of oleosin fusions; ie. oleosin<-> thioredoxin 

25 would represent oleosin-thioredoxin and thioredoxin-oleosin fusions). 

To determine whether having oleosin <-> thioredoxin and oleosin <- 
>thioredoxin-reductase proteins present on the same oil body would have a 
positive effect on the reducing activity of these proteins, crosses were set up to 
generate double transgenic Arabidopsis lines. The crosses are illustrated in 

30 Table 2. 
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TABLE 2 



15 



Male 




Female 


Confirmed double 
transgenic lines 
(PCR and Western 
Blot) 


oleo-thioredoxin 


X 


oleo-thioredoxin-reductase 


4 


oleo-thioredoxin 


X 


thioredoxin-reductase-oleo 


1 


thioredoxin-oleo 


X 


oleo-thioredoxin-reductase 


0 


thioredoxin-oleo 


X 


thioredoxin-reductase-oleo 


4 


oleo-thioredoxin- 
reductase 


X 


oleo-thioredoxin 


2 


oleo-thioredoxin- 
reductase 


X 


thioredoxin-oleo 


0 


thioredoxin- 
reductase-oleo 


X 


oleo-thioredoxin 


7 


thioredoxin- 
reductase-oleo 


X 


thioredoxin-oleo 


0 



Seeds from Arabidopsis crosses were germinated on PPT plates and the 
seedlings were transferred to soil after approximately 2 weeks. PCR 
experiments on DNA isolated from the seedlings identified a number of plants 

20 which contain both an oleosin<->thioredoxin and an oleosin<->thioredoxin- 
reductase gene construct within their genome. 

Seeds were harvested from these plants for expression and activity 
assays. Western blots were carried out to confirm expression of both oleosin<- 
>thioredoxin and oleosin<->thioredoxin-reductase in the lines. DTNB and 

25 insulin reduction assays were also performed to compare activity between single 
transgenic parent lines and the double transgenic offspring and results are 
summarized in Table 3. Table 3 summarizes DTNB reducing activity of various 
transgenic lines. The last 2 rows compare mixing oil bodies from single 
transgenic parent lines to using oil bodies from double transgenic offspring. 

30 Relative activity for the £ coli thioredoxin and thioredoxin mixture is set at 100 
percent. 



WO 02/50289 



PCT/US01/50240 



-94- 



TABLE 3 



Source Material 


Relative Activity (%) 


E.coli trx + NTR 


100 


Arab/doosfc "free" thioredoxin 4- 
thioredoxin-reductase 
(pSBS2520 + pSBS2527) 


100 


oleosin- M. lep TR/Trxh 
(pSBS2530) 


-30 


Oleosin < - > thioredoxin-reductase + 
oleosin <-> thioredoxin (mixing oil bodies 
from single-transgenic parents) 


-3 


Oleosin < - > thioredoxin-reductase X 
oleosin < - > thioredoxin 
(various double transgenic lines) 


-50 J 



15 

Based on DTNB and insulin reduction assays, it is evident that double 

c 

transgenic plants co-expressing oleosin <-> thioredoxin and oleosin <- 
> thioredoxin-reductase on the same, single oil body contained significantly more 
reducing activity compared to mixing oil bodies from single transgenic oleosin <- 
20 >thioredoxin and oleosin <-> thioredoxin-reductase lines. It was additionally 

apparent that oil body extracts from co-expressing lines contained more reducing 
activity compared to line pSBS2530 (oleosin-M. lep TR/Trxh), which was 
previously identified as the line containing the highest reducing activity from oil 
bodies. 

25 These results suggest that the creation of double transgenic lines (either 

through crossing or by co-transforming 2 expression constructs into plants) may 
represent one means by which we could solve our initial problem of not being 
able to generate reducing activity by mixing oil bodies from oleosin <- 
>thioredoxin and oleosin <-> thioredoxin-reductase single transgenic lines. 

30 Assays on seeds from Arabidopsis pSBS2542 transgenic lines that 

express oleosin-thioredoxin-reductase-linker-thioredoxin. 

Oil body extracts from four pSBS2542 lines were tested for reducing activity in 
DTNB and insulin reduction assays, using standard protocols described 
previously. Again, oil body extracts containing the oleosin-thioredoxin- 
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reductase-linker-thioredoxin protein possessed significant reducing activity. 
Based on such assays, it was revealed that the oleosin-thioredoxin-reductase- 
linker-thioredoxin synthetic fusion protein was more active than the oleosin-M 
lep TR/Trxh fusion. Furthermore, oil bodies containing the oleosin-thioredoxin- 
5 reductase-linker-thioredoxin protein appeared to have more reducing activity 
compared to oil bodies from double transgenic lines that co-expressed oleosin <- 
>thioredoxin and oleosin<->thioredoxin-reductase. The results comparing 
reducing activity for the various thioredoxin-reductase/thioredoxin constructs is 
summarized in Table 4. Table 4 summarizes DTIMB reducing activity of various 

10 transgenic lines. The pSBS2542 line expressing oleosin-thioredoxin-reductase- 
linker-thioredoxin contains significant reducing activity, comparable to the "free" 
forms of Arabidopsis thioredoxin and thioredoxin-reductase and the equivalent £. 
coli proteins. Relative activity for the £. coli thioredoxin and thioredoxin mixture 
is set at 1 00 percent. 

15 TABLE 4 



Source Material 


Relative Activity (%) 


Eco/i trx + NTR 


100 


Arabidopsis "free" thioredoxin + thioredoxin- 
reductase 
(PSBS2520 + pSBS2527) 


100 


oleosin- M. lep TR/Trxh 
(pSBS2530) 


-30 


Oleosin < - > thioredoxin-reductase + 
oleosin < - > thioredoxin 
(mixing oil bodies from single-transgenic parents) 




Oleosin < - > thioredoxin-reductase X 
oleosin < - > thioredoxin 
(various double transgenic lines) 


-50 


Oleosin-thioredoxin-reductase-linker-thioredoxin 
(PSBS2542) 


-75-100 
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Reduction assays comparing the utilization of NADH vs. NADPH as a 
cof actor (electron donor) for the thioredoxin-reductase/thioredoxin system. 
DTNB and insulin reduction assays were conducted as described previously, 
except that NADH was substituted for NADPH as an electron donor in the 
5 system utilizing E. coli thioredoxin-reductase and thioredoxin. Thus, a 
comparison was conducted of the utilization of NADH versus NADPH as a 
cofactor for the £. coli thioredoxin-reductase/ thioredoxin system. For the DTNB 
assay, the reaction mixture consisted of 400 //M DTNB, 1 0 //g/mL £ coli 
thioredoxin, and 10/ig/mLE coli thioredoxin-reductase in 100mM Tris-CI buffer 
10 pH 8.0. Either NADH or NADPH was then added to the DTNB reaction as 
follows: 

Reaction A. 200 //M NADPH (Sigma) 
Reaction B. 800 jjM NADH (Sigma) 
Reaction C. 800 //M NADH (Roche) 
15 Reaction D. (-) cofactor 

Reaction E. 800 //M NADH (no TR or Trxh). 

For the insulin reduction assay, the reaction mixture consisted of 1 mg/mL 
bovine pancreatic insulin, 20 //g/mL £. coli thioredoxin, and 20 //g/mL E. coli 
20 thioredoxin-reductase in 1 0OmM potassium phosphate buffer at pH 7.0. Either 
NADH or NADPH was then added to the reaction as follows: 
Reaction A. 800 //M NADPH (Sigma) 
Reaction B. 800 //M NADH (Sigma) 
Reaction C. 800 //M NADH (Roche) 
25 Reaction D. (-) cofactor 

Reaction E. 2 mM NADH (no TR or Trxh). 



The results indicate that NADH, purchased from either Sigma or Roche, could 
act as an electron donor in both the DTNB and insulin reduction assays. 
30 However, the rate of reduction was lower than the rate observed with NADPH 
as a cofactor. It was estimated that the rate of insulin reduction utilizing NADH 
as an electron donor was approximately 25-50% when compared to the 
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maximum rate using NADPH. Furthermore, it was estimated that the rate of 
DTNB reduction utilizing NADH as an electron donor was approximately 5-10% 
of the maximum rate using NADPH. Similar results were observed using the 
oleosin-thioredoxin-reductase-linker thioredoxin fusion protein on Arabidopsis oil 
5 bodies instead of the £ coli thioredoxin-reductase and thioredoxin. 

Example 5 

Production of multimeric immunoglobulin protein in plant seed cells and capture 
on oil bodies using Protein A - oleosin fusion proteins. 

10 1 - Production of multimeric immunoglobulin protein in plant seed cells 

For expression of multimeric-protein-complexes containing multimeric- 
immunoglobulin-complexes, the cDNA sequences encoding individual light and 
heavy chains can be isolated from; 1) cell lines expressing a particular antibody, 
such as clonal B cell lines, or a hybridoma cell line, or 2) may be a recombinant 

15 antibody, assembled by combining select light and heavy chain variable domains 
and available light and heavy chain constant domain sequences, respectively. 
Variable domains with specific binding properties may be isolated from screening 
populations of such sequences, usually in the form of a single-chain Fv phage 
display library. 

20 Starting from known nucleic acid sequences and a source of light and 

heavy chains, the mature polypeptide coding sequences of each chain is isolated 
with a secretion signal sequence. The signal sequence can be the native 
antibody sequence or derived from a known secreted plant sequence {e.g. a PR 
sequence from Arabidopsis or tobacco). The addition of a plant secretion signal 

25 sequence to both light and heavy chain mature coding sequences is carried out 
by standard molecular biology techniques. PCR fusion is used routinely to make 
such modifications. Secretion signal sequences are included to target the light 
and heavy immunoglobulin polypeptides for secretion from the cell and further 
assembly of the two chains into a multimeric-immunoglobulin-complex. For 

30 expression in transgenic plant seeds, an expression cassette is assembled 
comprising: 1) a regulatory promoter sequence to provide expression in plant 
seeds, 2) the secretion signal - light chain sequence, and 3) a regulatory 
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sequence to terminate transcription. A second expression cassette is assembled 
comprising: 1) a regulatory promoter sequence to provide expression in plant 
seeds, 2) the secretion signal - heavy chain sequence, and 3) a regulatory 
sequence to terminate transcription. Each of the antibody chain expression 
5 cassettes is cloned individually into an Agrobacterium plant transformation 
vector or is combined into a single transformation vector with both expression 
cassettes. In both cases, the expression cassettes are cloned into plant 
transformation vectors, between the left and right delineating border sequences, 
and adjacent to a plant selectable marker cassette. Each plant transformation 

10 vector is transformed into Agrobacterium. The resulting Agrobacterium strains 
are used to infect plant tissues. Transgenic plant material is regenerated and 
viable transgenic plants are selected. When individual transformation vectors are 
used, the transgenic plant lines that are produced, expressing either light or 
heavy chain sequences, are crossed to generate a single plant line expressing 

15 both chains in the same plant cell. When a single transformation vector, 
containing both light and heavy expression cassettes, is used, the initial 
transgenic plant line produces both light and heavy chain sequences in the same 
plant cell. 

2 - Production of transgenic oil bodies which display Protein A for the capture of 
20 immunoglobulins 

To capture and display immunoglobulin protein on oil bodies, oil bodies 
are engineered to display an immunoglobulin binding protein. In this example, the 
well-known antibody-binding domains from Protein A are used. Based on the 
known sequence for Protein A from Staphylococcus aureus, PCR primers are 

25 designed to isolate the five consecutive Ig-binding domains from the bacterial 
Protein A sequence. Primers are designed to allow cloning of the Protein A 
sequence as either an N-terminal or C-terminal fusion to an oleosin sequence for 
targeting to oil bodies. The sequence that encodes an in-frame translational 
fusion between Protein A and oleosin is cloned into a plant expression cassette 

30 for seed-specific expression. The final cassette consists of a regulatory 

promoter sequence that provides expression in seeds, the Protein A - oleosin 
fusion sequence, and a regulatory sequence to terminate transcription. The 
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Protein A - oleosin expression cassette is cloned into a plant transformation 
vector compatible with Agrobacterium - mediated plant transformation. The 
transformation vector comprises left and right border sequences flanking the 
Protein A - oleosin expression cassette and an adjacent plant selectable marker 
5 cassette. The Agrobacterium strain containing this vector is used to infect plant 
tissues and subsequent regeneration and selection from transgenic plant material 
to create transgenic plants. 

3 - Capture and display of multimeric-immunoglobulins on oil bodies displaying 
Protein A 

10 Having produced light and heavy chain multimeric immunoglobulin 

complexes in one transgenic plant line and the display of Protein A on oil bodies 
through the oil body targeting of a Protein A - oleosin fusion protein in a second 
plant line, at least two embodiments can be used to capture the immunoglobulin 
complex on the Protein A oil bodies. In the first embodiment, transgenic seed 

15 from both the immunoglobulin and the Protein A - oleosin expression lines is 
combined in an optimum ratio and then ground together such that the disrupted 
material, from both seed lines would be combined in the same extract. The 
combined seed extracts are mixed and/or incubated under conditions that allow 
maximum recovery of the immunoglobulin by Protein A. The oil body fraction is 

20 separated using standard phase separation techniques (e.g. centrifugation). The 
recovered oil body fraction contains both native oil bodies, from the 
immunoglobulin expression line, and transgenic Protein A oil bodies from the 
Protein A - oleosin expression line. 

In a second embodiment, the plant lines expressing the immunoglobulin 

25 complex and the Protein A - oleosin fusion are crossed and individual plant lines 
expressing both components are identified and propagated. In this approach, the 
immunoglobulin complex and the Protein A - oleosin fusion are produced in 
different cellular compartments of the same plant seed cell. Seed from the 
double transgenic line is ground to disrupt the cellular material and mix the 

30 contents of all cellular compartments, including combining the immunoglobulin in 
the extracellular compartment and the Protein A - oleosin on the oil body in the 
cytosolic compartment. The material is mixed and/or incubated under conditions 
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to allow maximum recovery of the immunoglobulin by Protein A f and the oil body 
fraction is separated by phase separation techniques. The recovered oil body 
fraction contains the displayed Protein A and the capture immunoglobulin 
complex. 

5 Example 6 

Production of assembled multimeric-immunoglobulin-complexes as fusions with 
oil body targeting domains. 

Individual polypeptides are produced as a fusion protein with oil body 
targeting sequences (e.g. oleosin) for display on oil bodies. It has been found 

10 that the individual subunits of naturally associating heterodimeric proteins can be 
co-produced as individual oleosin fusions and still associate as an active 
heterodimer on the surface of the oil body. In this example, the heterodimer is 
the light and heavy chain subunits, or derived portions thereof, of an 
immunoglobulin complex. 

1 5 Production of an immunoglobulin Fab complex on oil bodies. 

The mature light chain sequence, lacking the secretion signal sequence, is 
attached as an in-frame N-terminal fusion to an oleosin sequence. This fusion 
sequence is assembled into a seed-specific expression cassette consisting of a 
seed-specific promoter sequence, the light chain - oleosin fusion sequence, and 

20 a transcriptional terminator sequence. The expression cassette is inserted 

between the left and right border markers, adjacent to a plant selectable marker 
cassette, of a transformation vector. The transformation vector, in 
Agrobacterium, is used to infect plants and generate transgenic plants. 

An equivalent construct for the heavy chain subunit, comprising the 

25 variable and constant heavy chain domains, is also attached as an in-frame 
fusion to oleosin and assembled into an expression cassette for seed-specific 
expression. The expression cassette can be a part of a separate transformation 
vector for the generation of a separate transgenic line, or the heavy chain 
expression cassette can be combined together with the light chain cassette into 

30 a single transformation vector. If light and heavy chain expression cassettes are 
transformed into plants on separate transformation vectors, the individual plant 
lines are crossed to create a single line expressing both heterodimer subunit - 
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oleosin fusions in the same plant cell. Seed from the double transgenic line, or a 
single transgenic line generated from the dual expression vector, is extracted to 
isolate oil bodies. The seed material is ground to release the cellular contents 
and oil bodies are isolated by phase separation. The targeting of both light and 
5 heavy chain sequence to oil bodies, as oleosin fusions, allows the association of 
the immunoglobulin complex on the surface of the oil body. 

Similar configurations, using the entire heavy chain sequence in 
combination with the entire light chain sequence, or using the variable domains 
from both the light and heavy chain sequences, are constructed to assemble 

10 different types of heteromultimeric-immunoglobulin-complexes (e.g., 
heterodimers) on the surface of oil bodies. 

The present invention should therefore not be seen as limited to the 
particular embodiments described herein, but rather, it should be understood that 
the present invention has wide applicability with respect to protein expression 

15 generally. Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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SUMMARY OF SEQUENCES 

SEQ ID NOs:1-4 set forth primers which were synthesized for the 
isolation of the thioredoxin h (Trxh) and thioredoxin reductase genes from 
Arabidopsis, as described in Example 1 . 
5 SEQ ID NOs:5-7 set forth primers which were designed to code for a 

specific linker peptide between thioredoxin reductase and thioredoxin proteins, 
as described in Example 2. 

SEQ ID NOs:8, 10 and 1 1 set forth the nucleotide sequence and the 
deduced amino acid sequence of the NADPH thioredoxin reductase sequence 
10 isolated herein as described in Example 1. 

SEQ ID NOs:9 and 11, respectively, set forth the nucleotide sequence of 
the published NADPH thioredoxin reductase sequence (ATTHIREDB) and the 
deduced amino acid sequence* 

SEQ ID NO:12 sets forth the deduced amino acid sequence of the 
1 5 published NADPH thioredoxin reductase sequence. 

SEQ ID NO: 13 sets forth the deduced amino acid sequence of the NADPH 
reductase sequence isolated in this report. 

SEQ ID NOs:14 and 15 set forth the nucleotide sequence of the phaseolin 
promoter- Arabidopsis Trxh-phaseolin terminator sequence described in Example 
20 2, and the deduced amino acid sequence. The Trxh coding sequence and its 
deduced amino acid sequence is indicated. The phaseolin promoter corresponds 
to nucleotide 6-1 554, and the phaseolin terminator corresponds to nucleotide 
sequence 1905-3124. The promoter was furnished with a Psti site (nt 1-6) and 
the terminator was furnished with a Hindlll site (nt 1898-1903) and a Kpnl site 
25 (nt 31 24-31 29) to facilitate cloning. 

SEQ ID NOs:16, 17 and 18 set forth the nucleotide sequence of the 
phaseolin promoter-oleosin Trxh-phaseolin terminator sequence described in 
Example 2, and the deduced amino acid sequences. The oleosin-Trxh coding 
sequence and the deduced amino acid sequences are indicated in SEQ ID NO:16. 
30 As in SEQ ID NO:14, the phaseolin promoter corresponds to nucleotide 6-1554. 
The sequence encoding oleosin corresponds to nt 1555-2313, the intron in this 
sequence (nt 1 908-2147) is indicated in italics. The Trxh coding sequence 
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corresponds to nt 2314-2658. The phaseolin terminator corresponds to 
nucleotide sequence 2664-3884. 

SEQ ID NO: 19, 20 and 21 set forth the nucleotide sequence of the 
phaseolin promoter - Trxh oleosin-phaseolin terminator sequence as described in 
5 Example 2, and the deduced amino acid sequences. The Trxh oleosin- coding 
sequence and its deduced amino acid sequences are indicated in SEQ ID NO:19. 
As in SEQ ID N0s:14 and 16, the phaseolin promoter corresponds to nucleotide 
6-1554. The Trxh coding sequence corresponds to nt 1555-1896. The 
sequence encoding oleosin corresponds to nt 1897-2658, the intron in this 

10 sequence (nt 2250-2489) is indicated in italics. The phaseolin terminator 
corresponds to nucleotide sequence 2664-3884. 

SEQ ID NO:22 and 23 set forth the nucleotide sequence of the phaseolin 
promoter-thioredoxin-reductase-phaseolin terminator sequence as described in 
Example 2, and the deduced amino acid sequence. The thioredoxin-reductase 

15 coding sequence and its deduced amino acid sequence is indicated in SEQ ID 
NO:22. The phaseolin promoter corresponds to nucleotide 6-1554. The 
thioredoxin-reductase coding sequence corresponds to nt 1555-2556 and the 
deduced amino acid is set forth in SEQ ID NO:23. The phaseolin terminator 
corresponds to nucleotide sequence 2563-3782. 

20 SEQ ID NOs:24, 25 and 26 show the nucleotide sequence of the 

phaseolin promoter-oleosin thioredoxin-reductase-phaseolin terminator sequence 
as described in Example 2, and the deduced amino acid sequences. The oleosin- 
thioredoxin-reductase coding sequence and its deduced amino acid sequence is 
indicated. The phaseolin promoter corresponds to nucleotide 6-1554. The 

25 sequence encoding oleosin corresponds to nt 1555-2313, the intron in this 
sequence (nt 1980-2147) is indicated in italics. The thioredoxin-reductase 
coding sequence corresponds to nt 2314-3315. The phaseolin terminator 
corresponds to nucleotide sequence 3321-4540. 

SEQ ID NOs:27, 28 and 29 show the nucleotide sequence of the 

30 phaseolin promoter - thioredoxin-reductase oleosin - phaseolin terminator 

sequence as described in Example 2, and the deduced amino acid sequences. 
The thioredoxin-reductase coding sequence and its deduced amino acid 
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sequence is indicated. The phaseolin promoter corresponds to nucleotide 6- 
1554. The thioredoxin-reductase coding sequence corresponds to nt 1555- 
2553. The sequence encoding oleosin corresponds to nt 2554-3315, the intron 
in this sequence (nt 2751-3146) is indicated in italics. The phaseolin terminator 
5 corresponds to nucleotide sequence 3321-4540. 

SEQ ID NO:30, 31 and 32 show the sequence of the phaseolin promoter - 
oleosin - Mlep thioredoxin-reductase/thioredoxin -phaseolin terminator sequence 
as described in Example 2, and the deduced amino acid sequences. The oleosin- 
Mlep thioredoxin-reductase/thioredoxin coding sequence and its deduced amino 

10 acid sequence is indicated. The phaseolin promoter corresponds to nucleotide 6- 
1554. The sequence encoding oleosin corresponds to nt 1555-2313, the intron 
in this sequence (nt) is indicated in italics. The Mlep thioredoxin- 
reductase/thioredoxin coding sequence corresponds to nt 2314-3690. The 
phaseolin terminator corresponds to nucleotide sequence 3698-4917. 

15 SEQ ID NOs:33, 34 and 35 set forth the nucleotide sequence of the 

phaseolin promoter-oleosin-thioredoxin-reductase-linker-thioredoxin-phaseolin 
terminator region of pSBS2542, and the deduced amino acid sequences. The 
deduced amino acid sequence of oleosin-thioredoxin-reductase-linker-thioredoxin 
is also shown in SEQ ID NO:33. Amino acids representing oleosin are set forth 

20 at positions 1-173, those amino acids representing thioredoxin-reductase are set 
forth at positions 174-501, those amino acids representing the linker or spacer 
peptide are set forth at positions 501-524, and those representing thioredoxin 
are set forth at positions 525-636. 

SEQ ID NOs:38 and 39 set forth the nucleotide sequence of Arabidopsis 

25 Thaliana Thioredoxin h (Trx h 1 ) and the encoded protein, respectively. 

SEQ ID N0s:40 and 41 set forth the nucleotide sequence of Arabidopsis 
Thaliana Thioredoxin Reductase (NTR1) and the encoded protein, respectively. 

SEQ ID NOs:42 and 43 set forth the nucleotide sequence of E. Coli 
Thioredoxin (TrxA) and the encoded protein, respectively. 

30 SEQ ID N0s:44 and 45, set forth the nucleotide sequence of E. Coli 

Thioredoxin Reductase and the encoded protein, respectively. 
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SEQ ID NOs:46 and 47 set forth the nucleotide sequence of Human 
Thioredoxin and the encoded protein, respectively. 

SEQ ID NOs:48 and 49, set forth the nucleotide sequence of Human 
Thioredoxin Reductase and the encoded protein, respectively, 
5 SEQ ID NOs:50 and 51, respectively, set forth the nucleotide sequence of 

Mycobacterium leprae Thioredoxin-Thioredoxin Reductase and the encoded 
protein, respectively. 

SEQ ID NOs:52-313 are described in Table 5. 



TABLE 5 

10 



SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


PLANT THIOREDOXINS 


Thioredoxin f-type 


52 


(Q9XFH8) Thioredoxin F-type 1 , chloroplast precursor (TRX- 
F1). - Arabidopsis thaliana (Mouse-ear cress) Jj 


53 


(Q9XFH9) Thioredoxin F-type 2, chloroplast precursor (TRX- 
F2). {GENE: AT5G16400 OR MQK4.13) - Arabidopsis thaliana 
(Mouse-ear cress) 


54 


(048897) Thioredoxin F-type, chloroplast precursor (TRX-F). 
{GENE: TRXF} - Brassica napus (Rape) 


55 


(081332) Thioredoxin F-type, chloroplast precursor (TRX-F). - 
Mesembryanthemum crystallinum (Common ice plant) 


56 


(P29450) Thioredoxin F-type, chloroplast precursor (TRX-F). - 
Pisum sativum (Garden pea) 


57 


(P09856) Thioredoxin F-type, chloroplast precursor (TRX-F). - 
Spinacia oleracea (Spinach) 


Thioredoxin m-type 


58 


(P06544) Thioredoxin 1 (TRX-1) (Thioredoxin M). {GENE: 
TRXA} - Anabaena sp. (strain PCC 7119) 


59 


(048737) Thioredoxin M-type 1 , chloroplast precursor (TRX- 
M1). {GENE: AT1G03680 OR F21B7_7 OR F21B7.28} - 
Arabidopsis thaliana (Mouse-ear cress) 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


60 


(Q9SEU8) Thioredoxin M-type 2 f chloroplast precursor (TRX- 
M2). {GENE: AT4G03520 OR F9H3.15 OR T5L23.1} - 
Arabidopsis thaliana (Mouse-ear cress) 


61 


(Q9SEU7) Thioredoxin M-type 3, chloroplast precursor (TRX- 
M3). {GENE: AT2G15570 OR F9013.12} - Arabidopsis 
thaliana (Mouse-ear cress) 


62 


(Q9SEU6) Thioredoxin M-type 4, chloroplast precursor (TRX- 
M4). - Arabidopsis thaliana (Mouse-ear cress) 


63 


(Q9XGS0) Thioredoxin M-type, chloroplast precursor (TRX-M). 
- Brassica napus (Rape) 


64 


(P23400) Thioredoxin M-type, chloroplast precursor (TRX-M) 
(Thioredoxin CH2). {GENE: TRXM} - Chlamydomonas 
reinhardtii 


65 


(Q41864) Thioredoxin M-type, chloroplast precursor (TRX-M). 
{GENE: TRM1} - Zea mays (Maize) 


66 


(Q9ZP20) Thioredoxin M-type, chloroplast precursor (TRX-M). 
- Oryza sativa (Rice) 


67 


(P48384) Thioredoxin M-type, chloroplast precursor (TRX-M). - 
Pisum sativum (Garden pea) 


68 


(P07591) Thioredoxin M-type, chloroplast precursor (TRX-M). - 
Spinacia oleracea (Spinach) 


69 


(Q9ZP21) Thioredoxin M-type, chloroplast precursor (TRX-M). 
- Triticum aestivum (Wheat) 


70 


(P1 2243) Thioredoxin 1 (TRX-1) (Thioredoxin M). {GENE: 
TRXA OR TRXM} - Synechococcus sp. (strain PCC 7942) 
(Anacystis nidulans R2) 


71 


(P37395) Thioredoxin. {GENE: TRXA OR TRX} - Cyanidium 
caldarium [Chloroplast] 


72 


(022022) Thioredoxin. {GENE: TRXA OR TRXM} - 
Cyanidioschyzon merolae [Chloroplast] 


73 


(P50338) Thioredoxin. {GENE: TRXA} - Griffithsia pacifica 
[Chloroplast] 
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I SEQ. ID 
NO.. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


74 


(P50254) Thioredoxin. {GENE: TRXA} - Porphyra yezoensis 
[Chloroplast] 


75 


(P51 225) Thioredoxin. {GENE: TRXA} - Porphyra purpurea 
[Chloroplast] 


Thioredoxin h-type 


76 


(P29448) Thioredoxin H-type 1 (TRX-H-1). {GENE: TRX1 OR 
AT3G51030 OR F24M12.70} - Arabidopsis thaliana (Mouse- 
ear cress) 


77 


(P20857) Thioredoxin 2 (TRX-2). {GENE: TRXB} - Anabaena 
sp. (strain PCC 7120) 


78 


(Q42388) Thioredoxin H-type 1 (TRX-H-1) (Pollen coat 
protein). {GENE: THL-1 OR B0PC17} - Brassica napus (Rape), 
Brassica oleracea (Cauliflower) 


79 


(P29449) Thioredoxin H-type 1 (TRX-H1). - Nicotians tabacum 
(Common tobacco) 


80 


(Q38879) Thioredoxin H-type 2 (TRX-H-2). {GENE: TRX2 OR 
AT5G39950 OR MYH19.14} - Arabidopsis thaliana (Mouse-ear 
cress) 


81 


(Q39362) Thioredoxin H-type 2 (TRX-H-2). {GENE: THL-2} - 
Brassica napus (Rape) 


82 


(Q07090) Thioredoxin H-type 2 (TRX-H2). - Nicotiana tabacum 
(Common tobacco) 


83 


(Q42403) Thioredoxin H-type 3 (TRX-H-3). {GENE: TRX3 OR 
AT5G42980 OR MBD2.18} - Arabidopsis thaliana (Mouse-ear 
cress) 


84 


(Q39239) Thioredoxin H-type 4 (TRX-H-4). {GENE: TRX4} - 
Arabidopsis thaliana (Mouse-ear cress) 


85 j 


(Q39241) Thioredoxin H-type 5 (TRX-H-5). {GENE: TRX5} - 
Arabidopsis thaliana (Mouse-ear cress) 


86 


(064432) Thioredoxin H-type (TRX-H). {GENE: PEC-2} - 
Brassica rapa (Turnip) • j 
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SEQ. (D 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


87 


(P80028) Thioredoxin H-type (TRX-H) (Thioredoxin CH1). 

/GFNF' TRXHl - Chlamvdomnna<5 rpinharHtii 

i \j limu. i ij v> i nai ii yuwi i iui lao idiiiioivli.ll 


88 


(Q96419) Thioredoxin H-type (TRX-H). - Fagopyrum 
esculentum (Common buckwheat) 


89 


(Q42443) Thioredoxin H-type (TRX-H) (Phloem sap 13 kDa 
protein-1). - Oryza sativa (Rice) 


90 


(065049) Thioredoxin H-type (TRX-H). {GENE: SB09} - Picea 
mariana (Black spruce) 


91 


(Q43636) Thioredoxin H-type (TRX-H). - Ricinus communis 
(Castor bean) 


92 


(064394) Thioredoxin H-type (TRX-H) (TrxTa). - Triticum 
aestivum (Wheat) 


93 


(P29429) Thioredoxin. - Emericella nidulans (Aspergillus 
nidulans) 


VIRUSES, BACTERIA AND FUNGI THIOREDOXINS 


94 


(P80579) Thioredoxin (TRX). {GENE: TRXA} - Alicyclobacillus 
acidocaldarius (Bacillus acidocaldarius) 


95 


(028137) Thioredoxin. {GENE: AF2145} - Archaeoglobus 
fulgidus 


96 


(P14949) Thioredoxin (TRX). {GENE: TRXA OR TRX} - Bacillus 
subtilis 


97 


(P00276) Thioredoxin. {GENE: NRDC} - Bacteriophage T4 


98 


(051088) Thioredoxin (TRX). {GENE: TRXA OR BB0061} - 
Borrelia burgdorferi (Lyme disease spirochete) 


99 


(P57653) Thioredoxin (TRX). {GENE: TRXA OR BU597} - 
Buchnera aphidicola (subsp. Acyrthosiphon pisum) | 
(Acyrthosiphon pisum symbiotic bacterium) 


100 


(051890) Thioredoxin (TRX). {GENE: TRXA} - Buchnera 
aphidicola (subsp. Schizaphis graminum) ■ 


101 


(P10472) Thioredoxin (TRX). {GENE: TRXA} - Chlorobium 
limicola f.sp. thiosulfatophilum 



wo 



02/50289 



PCT/US01/50240 



-109- 



SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 

(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


102 


(Q9PJK3) Thioredoxin (TRX). {GENE: TRXA OR TC0826} - 
Chlamydia muridarum 


103 


(Q9Z7P5) Thioredoxin (TRX). {GENE: TRXA OR CPN0659 OR 
CP0088} - Chlamydia pneumoniae (Chlamydophila 
pneumoniae) 


104 


(P52227) Thioredoxin (TRX). {GENE: TRXA} - Chlamydia 
psittaci (Chlamydophila psittaci) 


105 


(084544) Thioredoxin (TRX). {GENE: TRXA OR CT539} - 
Chlamydia trachomatis 


106 


(P00275) Thioredoxin C-1 . - Corynebacterium nephridii 


107 


(P07887) Thioredoxin C-2. - Corynebacterium nephridii 


108 


(P52228) Thioredoxin C-3. - Corynebacterium nephridii 


109 


(P09857) Thioredoxin (TRX). {GENE: TRXA} - Chromatium 
vinosum 


110 


(P21609) Thioredoxin (TRX). {GENE: TRXA} - Clostridium 
litorale (Bacterium Wo) 


111 


(P81 108) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Clostridium sporogenes 


112 


(P81109) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Clostridium sticklandii 


113 


(Q9UW02) Thioredoxin (Allergen Cop c 2). - Coprinus comatus 
(Shaggy mane) 


114 


(P29445) Thioredoxin 1. {GENE: TRXA OR TRX1} - 
Dictyostelium discoideum (Slime mold) 


115 


(P29446) Thioredoxin 2 (Fragment). {GENE: TRXB OR TRX2} - 
Dictyostelium discoideum (Slime mold) 


116 


(P29447) Thioredoxin 3. {GENE: TRXC OR TRX3} - 
Dictyostelium discoideum (Slime mold) 


117 


(P00274) Thioredoxin 1 (TRX1) (TRX). {GENE: TRXA OR 
TSNC OR FIPA OR B3781} - Escherichia coli, Salmonella 
typhimurium 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


118 


(P52232) Thioredoxin-like protein SLR0233. {GENE: SLR0233} 
- Synechocystis sp. (strain PCC 6803) 


119 


(P33636) Thioredoxin 2 (Trx2). {GENE: TRXC OR B2582 OR 
Z3867 OR ECS3448} - Escherichia coli, Escherichia coli 
0157.-H7 


120 


(P21 610) Thioredoxin (TRX). {GENE: TRXA} - Eubacterium 
acidaminophilum 


121 


(P43785) Thioredoxin (TRX). {GENE: TRXA OR TRXM OR 
HI0084} - Haemophilus influenzae 


122 


(P43787) Thioredoxin-like protein HI1 115. {GENE: HI1 115}- 
Haemophilus influenzae 


123 


(P56430) Thioredoxin (TRX). {GENE: TRXA OR HP0824 OR 
JHP0763} - Helicobacter pylori (Campylobacter pylori), 
Helicobacter pylori J99 (Campylobacter pylori J99) 


124 


(Q9S386) Thioredoxin (EC 1 .6.4.5) {GENE:TRXA} - Listeria 
monocytogenes 


125 


(Q57755) Thioredoxin. {GENE: TRX OR MJ0307} - 
Methanococcus jannaschii 


126 


(P47370) Thioredoxin (TRX). {GENE: TRXA OR TRX OR 
MG124} - Mycoplasma genitalium 


127 


(P46843) Bifunctional thioredoxin-reductase/thioredoxin 
[Includes: Thioredoxin-reductase (EC 1 .6.4.5) (TRXR); 
Thioredoxin]. {GENE: TRXB/A OR TRX OR ML2703} - 
Mycobacterium leprae 


128 | 


(P75512) Thioredoxin (TRX). {GENE: TRXA OR TRX OR 
MPN263 OR MP570} - Mycoplasma pneumoniae 


129 


(030974) Thioredoxin (TRX). {GENE: TRXA} - Mycobacterium 
smegmatis 


130 


(P52229) Thioredoxin (TRX) (MPT46). {GENE: TRXA OR TRX 
OR TRXC OR RV3914 OR MT4033 OR MTV028.05} - 
Mycobacterium tuberculosis || 


131 


(P42115) Thioredoxin. {GENE: TRX} - Neurospora crassa || 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


132 


(P34723) Thioredoxin. {GENE: TRXA} - Penicillium 
chrysogenum 


133 


(Q9X2T1) Thioredoxin (TRX). {GENE: TRXA OR TRX OR 
PA5240} - Pseudomonas aeruginosa 


134 


(P10473) Thioredoxin (TRX). {GENE: TRXA} - Rhodospirillum 
rubrum 


135 


(PO8058) Thioredoxin (TRX). {GENE: TRXA} - Rhodobacter 
sphaeroides (Rhodopseudomonas sphaeroides) 


136 


(Q9ZEE0) Thioredoxin (TRX). {GENE: TRXA OR RP002} - 
Rickettsia prowazekii 


137 


(P33791) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Streptomyces aureofaciens 


138 


(P52230) Thioredoxin (TRX). {GENE: TRXA OR SCH24.11C} - 
Streptomyces coelicolor 


139 


(Q05739) Thioredoxin (TRX). {GENE: TRXA} - Streptomyces 

CIS VulfycrUS 


140 


(P52231) Thioredoxin (TRX). {GENE: TRXA OR SLR0623} - 
Synechocystis sp. (strain PCC 6803) 


141 


(P73263) Thioredoxin-like protein SLR1 139. {GENE: SLR1 139} 
- Synechocystis sp. (strain PCC 6803) 


142 


(P52233) Thioredoxin (TRX). {GENE: TRXA} - Thiobacillus 
ferrooxidans 


143 


(P96132) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Thiocapsa roseopersicina 


144 


(P81 110) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Tissierella creatinophila 


145 


(083889) Thioredoxin (TRX). {GENE: TRXA OR TP0919} - 
Treponema pallidum 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


ANIMAL THIOREDOXIN 


146 


(097680) Thioredoxin. {GENE: TXN} - Bos taurus (Bovine) 


147 


(Q95108) Thioredoxin, mitochondrial precursor (MT-TRX). 
{GENE: TXN2} - Bos taurus (Bovine) 


148 


(Q09433) Thioredoxin. {GENE: B0228.5} - Caenorhabditis 
elegans 


149 


(P99505) Thioredoxin (Fragment). {GENE: TXN} - Canis 
familiaris (Dog 


150 


(P08629) Thioredoxin. {GENE: TXN} - Gallus gallus (Chicken) 


151 


(P47938) Thioredoxin (Deadhead protein). {GENE: DHD OR 
CG41 93} - Drosophila melanogaster (Fruit fly) 


152 


(P1 0599) Thioredoxin (ATL-derived factor) (ADF) (Surface 
associated sulphydryl protein) (SASP). {GENE: TXN OR TRDX 
wn i nAj norno sapiens \nurnanj 


153 


(Q99757) Thioredoxin, mitochondrial precursor (MT-TRX). 
^ocimc. i aimzj numo sapiens inurnanj 


1 RA 


lr<&,57*rul/ 1 i HOI cUUXlll. \VJCIMC 1 AIM/ IVIaCaCa JTlUldlLd \rUieSUb 

macaque) 


155 


(P1 0639) Thioredoxin (ATL-derived factor) (ADF). {GENE: 
TXN} - Mus musculus (Mouse) 


156 


(P97493) Thioredoxin, mitochondrial precursor (MT-TRX). 
{GENE: TXN2} - Mus musculus (Mouse) 


157 


(P82460) Thioredoxin (Fragment). {GENE: TXN} - Sus scrofa 
(Pig) 


158 


(P08628) Thioredoxin. {GENE: TXN} - Oryctolagus cuniculus 
(Rabbit) 


159 


(P11232) Thioredoxin. {GENE: TXN} - Rattus norvegicus (Rat) 


160 


(P97615) Thioredoxin, mitochondrial precursor (MT-TRX). 
{GENE: TXN2 OR TRX2} - Rattus norvegicus (Rat) 


161 


(P50413) Thioredoxin. {GENE: TXN} - Ovis aries (Sheep) 


PLANTS THIOREDOXIN-UKE PROTEINS 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


162 


(023166) THIOL-DISULFIDE INTERCHANGE LIKE PROTEIN 
(THIOREDOXIN-LIKE PROTEIN) {GENE:C7A10.160 OR 
AT4G37200 OR HCF1 64} - Arabidopsis thaliana (Mouse-ear 
cress) 


163 


(Q9C9Y6) Thioredoxin-like protein {GENE:F17014.18} - 
Arabidopsis thaliana (Mouse-ear cress) 


164 


(Q9FYD5) Thioredoxin-like protein {GENE:F21E1 J 80} - 
Arabidopsis thaliana (Mouse-ear cress) 


165 


(Q38878) THIOREDOXIN-LIKE PROTEIN {GENE:TRX6 OR 
T7D17.3} - Arabidopsis thaliana (Mouse-ear cress) 


166 


(Q9LVI2) Thioredoxin-like protein - Arabidopsis thaliana 
(Mouse-ear cress) 


167 


(Q9SCN9) Thioredoxin-like protein {GENE:T4D2.1 50} - 
Arabidopsis thaliana (Mouse-ear cress) 


168 


(Q9SRD7) Thioredoxin-like protein, 49720-48645 
{GENE:F28016.13} - Arabidopsis thaliana (Mouse-ear cress) 


169 


(Q9SU84) THIOREDOXIN-LIKE PROTEIN {GENE:T16L4.180 OR 
AT4G29670} - Arabidopsis thaliana (Mouse-ear cress) 


170 


(Q9SWG6) Thioredoxin-like protein {GENErTRX} - Hordeum 

DUIDOSUm 


171 


(Q9SWG4) Thioredoxin-like protein {GENE:TRX} - Lolium 
perenne (Perennial ryegrass) 


172 


(Q9AS75) Thioredoxin-like protein {GENE:P0028E10.17} - 
Oryza sativa (Rice) 


173 


(004002) CDSP32 protein (Chloroplast Drought-induced Stress 
Protein of 32kDa) - Solanum tuberosum (Potato) 


174 


(Q9SWG5) Thioredoxin-like protein {GENE:TRX} - Secale 
cereale (Rye) 


175 i 


(Q9SP36) Thioredoxin-like protein (Fragment) {GENErTRX} - 
Secale cereale (Rye) 


176 


(Q9U515) Thioredoxin-like protein - Manduca sexta (Tobacco 
hawkmoth) (Tobacco hornworm) 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


VIRUSES, BACTERIA AND FUNGI THIOREDOXIN-LIKE PROTEINS 


177 


(P43221) Thiolcdisulfide interchange protein tlpA {Cytochrome 
c biogenesis protein tlpA). {GENE: TLPA} - Bradyrhizobium ] 
japonicum 


178 


(P43787) Thioredoxin-like protein HI1 1 15. {GENE: HI1 1 1 5} - 
Haemophilus influenzae 


179 


(Q9GUP7) Thioredoxin-like protein {GENE:TRXLP} - Leishmania 
major 


180 


(Q9UVH0) Thioredoxin-like protein - Mortierella alpina 


181 


(P95355) Thioredoxin-like protein - Neisseria gonorrhoeae 


182 


(Q98G37) Thioredoxin-like protein {GENE:MLL3505} - 
Rhizobium loti (Mesorhizobium loti) 


183 


(P36893) Thiol :disulfide interchange protein helX precursor 
(Cytochrome c biogenesis protein helX). {GENE: HELX} - 
Rhodobacter capsulatus (Rhodopseudomonas capsulata) 


184 


(P52232) Thioredoxin-like protein SLR0233. {GENE: SLR0233} 
- Synechocystis sp. (strain PCC 6803) 


185 


(P73263) Thioredoxin-like protein SLR1139. {GENE: SLR1139} 
- Synechocystis sp. (strain PCC 6803) 


186 


(Q9USR1) Thioredoxin-like protein {GENE:SPBC577.08C} - 
Schizosaccharomyces pombe (Fission yeast) 


187 


(Q9R788) Thioredoxin {GENE.TPTRX} - Treponema pallidum 


ANIMALS THIOREDOXIN-LIKE PROTEINS 


188 


(Q9UAV4) F46E10.9 PROTEIN (THIOREDOXIN-LIKE PROTEIN 
DPY-1 1) {GENE:F46E10.9 OR DPY-1 1} - Caenorhabditis 
elegans 


189 


(Q9N2K6) Thioredoxin-like protein (Y54E10A.3 protein) 
(Thioredoxin-like protein TXL) {GENE:TXL OR Y54E10A.3} - 
Caenorhabditis elegans 


190 


(Q9VRP3) THIOREDOXIN-LIKE PROTEIN TXL (CG5495 
PROTEIN) {GENE:TXL OR CG5495} - Drosophila melanogaster 
(Fruit fly) 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 

(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


191 


(043396) Thioredoxin-like protein (32 kDa thioredoxin-related i 
protein). {GENE: TXNL OR TRP32 OR TXL} - Homo sapiens 
(Human) 


192 


(076003) Thioredoxin-like protein - Homo sapiens (Human) 


193 


(Q9S753) THIOREDOXIN-LIKE PROTEIN {GENE:TRX} - Phalaris 
coerulescens 


194 


(O77404) TRYPAREDOXIN - Trypanosoma brucei brucei 


PLANT THIOREDOXIN-REDUCTASES 


195 


(Q39243) Thioredoxin-reductase 1 (EC 1.6.4.5) (NADPH- 
dependent thioredoxin-reductase 1) (NTR 1). {GENE: NTR1 OR 
AT4G35460 OR F1 5J1 .30} - Arabidopsis thaliana (Mouse-ear 
cress) 


196 


(Q39242) Thioredoxin-reductase 2 (EC 1.6.4.5) (NADPH- 
dependent thioredoxin-reductase 2) (NTR 2). {GENE: NTR2 OR 
AT2G17420 OR F5J6.18} - Arabidopsis thaliana (Mouse-ear 
cress) 


VIRUSES, BACTERIA AND FUNGI THIOREDOXIN-REDUCTASES 


197 


(O66790) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR). {GENE: 
TRXB OR AQ 500} - Aquifex aeolicus 


198 


(P80880) Thioredoxin-reductase (EC 1.6.4.5) (TRXR) (General 
stress protein 35) (GSP35). {GENE: TRXB} - Bacillus subtilis 


199 


(P94284) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR). {GENE: 
TRXB OR BB051 5} - Borrelia burgdorferi (Lyme disease 
spirochete) 


200 


(P57399) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR BU314} - Buchnera aphidicola (subsp. Acyrthosiphon 
pisum) (Acyrthosiphon pisum symbiotic bacterium) 


201 


(P81 433) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Buchnera aphidicola (subsp. Schizaphis graminum) 


202 


(Q9PKT7) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR). {GENE: 
TRXB OR TC0375} - Chlamydia muridarum 



WO 02/50289 



PCT/US01/50240 



-116- 



SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER j 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


203 


(Q9Z8M4) Thioredoxin-reductase (EC 1 6 4 5) (TRXR) fGENF* 
TRXB OR CPN0314 OR CP0444} - Chlamydia pneumoniae 
(Chlamydophila pneumoniae) 


204 


(084101) Thioredoxin-reductase {EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR CT099} - Chlamydia trachomatis 


205 


(P522131 Thiorednyin-rpdurtasp VEC 1 fi d 5) (TRXR1 /fiFMF- 
TRXB} - Clostridium litorale (Bacterium W6) 


206 


(P39916) Thiorpdnyin-rpriurta<5P IFC 1 fi4R| /TRXRl /ttFMF- 

TRXB} - Coxiella burnetii 


207 


(P0962m Thinrpdnvih-rpriiirta<5p IPC 1 fiARl fTRyRl /"f^FMF* 
TRXB OR B0888 OR Z1232 OR ECS0973} - Escherichia coli, 
Escherichia coli 0157:H7 


208 


(P50971) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Eubacterium acidaminophilum 


209 


(P43788) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR). {GENE: 
TRXB OR HI1 1 58) - Haemophilus influenzae 

1 1 \mf v«/ 11 III 1 1 w \J I | lUwl 1 t\J f-Ji 1 1 1 U O II II IU v) l£. Gl vJ 


210 


(Q9ZL18) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR JHP07641 - Helicobacter Dvlori J99 (Camnvlohart#»r 
pylori J99) 


211 


(P56431) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR HP0825} - Helicobacter pylori (Campylobacter pylori) 


212 


(032823) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR LM02478} - Listeria monocytogenes 


213 


(P47348) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR MG102} - Mycoplasma genitalium 


214 


(P46843) Bifunctional thioredoxin-reductase/thioredoxin 
[Includes: Thioredoxin-reductase (EC 1.6.4.5) (TRXR); 
Thioredoxin]. {GENE: TRXB/A OR TRX OR ML2703} - 
Mycobacterium leprae 


215 


(P75531) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR MPN240 OR MP591} - Mycoplasma pneumoniae 


216 


(030973) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Mycobacterium smegmatis 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 

(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


217 


(P52214) Thioredoxin-reductase (EC 1.6.4.5) (TRXR) (TR). 
{GENE: TRXB OR RV3913 OR MT4032 OR MTV028.04} - 
Mycobacterium tuberculosis 


218 


(P51 978) Thioredoxin-reductase (EC 1.6.4.5). {GENE: CYS-9} 
- Neurospora crassa 


219 


(P43496) Thioredoxin-reductase (EC 1.6.4.5). {GENE: TRXB} - 
Penicillium chrysogenum 


220 


(Q9ZD97) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR). {GENE: 
TRXB OR RP445} - Rickettsia prowazekii 


221 


(Q92375) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
SPBC3F6.03} - Schizosaccharomyces pombe (Fission yeast) 


222 


(OOH74.11 Thiorprinxin-rpHnrtflcsp IPC 1 fi 4- RWTRXR) ffiFNF- 

TRXB} - Streptomyces clavuligerus 


223 


(P52215) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR SCH24.12} - Streptomyces coelicolor 


224 


(083790) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR). {GENE: 
TRXB OR TP0814} - Treponema pallidum 


225 


(P80892) Thioredoxin-reductase (EC 1 .6.4.5) (TRXR) 
(Fragment). {GENE: TRXB} - Vibrio fischeri 


226 


(P29509) Thioredoxin-reductase 1 (EC 1.6.4.5). {GENE: TRR1 j 
OR YDR353W OR D9476.5} - Saccharomyces cerevisiae 
(Baker's yeast) 


227 


(P38816) Thioredoxin-reductase 2 f mitochondrial precursor (EC 
1 .6.4.5). {GENE: TRR2 OR YHR106W} - Saccharomyces 
cerevisiae (Baker's yeast) 


ANIMAL THIOREDOXIN-REDUCTASES 


228 


(062768) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
TXNRD1} - Bos taurus (Bovine) 


229 


(Q1 7745) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
C06G3.7} - Caenorhabditis elegans 


230 


(Q1 6881) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
TXNRD1} - Homo sapiens (Human) 



WO 02/50289 



PCT/US01/50240 



-118- 



SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 

(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


231 


(Q25861) Thioredoxin-reductase (EC 1 .6.4.5)(TrxR). {GENE: 
TR OR GR} - Plasmodium falciparum (isolate FCH-5) 


Other thioredoxin-reductases 


PLANTS THIOREDOXIN-REDUCTASES 


232 


(022229) Thioredoxin-reductase {GENE:AT2G41 680} - 
Arabidopsis thaliana (Mouse-ear cress) 


233 


(Q39951) NADPH thioredoxin-reductase (Fragment) - 
Helianthus annuus (Common sunflower) 


VIRUSES, BACTERIA AND FUNGI THIOREDOXIN-REDUCTASES 


234 


(028718) THioredoxin-reductase (TRXB) {GENE:AF1554} - 
Archaeoglobus fulgidus 


235 


(Q9K703) Thioredoxin-reductase (NADPH) (EC 1 .6.4.5) 
{GENE:TRXB OR BH3571} - Bacillus halodurans 


236 


(Q9K7F3) Thioredoxin-reductase {GENE:BH3408} - Bacillus 
halodurans 


237 


(Q9KCZ0) Thioredoxin-reductase {GENE:BH1429} - Bacillus 
halodurans 


238 


(Q9KCZ1) Thioredoxin-reductase {GENE:BH1428} - Bacillus 
halodurans 


239 


(Q9PIY1) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB OR 
CJ0146} - Campylobacter jejuni 


240 I 


(Q9A4G3) Thioredoxin-reductase {GENE:CC2871} - 
Caulobacter crescentus 


241 ! 


(Q97EM8) Thioredoxin-reductase {GENE:CAC3082} - 
Clostridium acetobutylicum 


242 


(Q97IU2) Thioredoxin-reductase {GENE:CAC1548} - 
Clostridium acetobutylicum 


243 


(Q9EV96) Thioredoxin-reductase {GENE:TRXB} - Clostridium 
sticklandii 


244 


(Q9RSY7) THioredoxin-reductase {GENE:DR1982} - 
Deinococcus radiodurans 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 

(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


245 


(030739) Thioredoxin-reductase (Fragment) - Enterococcus 
faecalis (Streptococcus faecalis) 


246 


(054535) Thioredoxin-reductase {GENE:TRXB OR TRXB1 2 
OR VNG6452G OR TRXB1J OR VNG6074G} - Halobacterium 
sp. (strain NRC-1) [Plasmid pNRC100, and Plasmid pNRC200] 


247 


(P82854) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB2} - 
Halobacterium sp. (strain NRC-1 ) 


248 


(Q9HN08) Thioredoxin-reductase {GENE:TXRB3 OR 
VNG2301G} - Halobacterium sp. (strain NRC-1) 


249 


(025779) THioredoxin-reductase (TRXB) {GENE:HP1 164} - 
Helicobacter pylori (Campylobacter pylori) 


250 


(086255) Thioredoxin-reductase {GENE:TRXB} - Klebsiella 
oxytoca 


251 


(Q9AEV9) Thioredoxin-reductase (Fragment) {GENE.TRXB} - 
Lactococcus lactis (subsp. lactis) (Streptococcus lactis) 


252 


(Q9CF34) Thioredoxin-reductase (EC 1 .6.4.5) {GENE:TRXB2} - 
Lactococcus lactis (subsp. lactis) (Streptococcus lactis) 


253 


(Q9CH02) Thioredoxin-reductase (EC 1.6.4.5) {GENE.TRXB 1} 
- Lactococcus lactis (subsp. lactis) (Streptococcus lactis) 


254 


(Q9ZFC8) Thioredoxin-reductase (Fragment) {GENETRXB} - 
Lactococcus lactis 


255 


(032822) Hypothetical 39.7 kDa protein (Fragment) - Listeria 
monocytogenes 


256 I 


(O26804) THioredoxin-reductase {GENE:MTH708} - 
Methanothermobacter thermautotrophicus 


257 


(P94397) Homologue of thioredoxin-reductase of Mycoplama 
genitalium {GENE:YCGT} - Bacillus subtilis 


258 


(Q98PK9) THioredoxin-reductase (EC 1.6.4.5) 
{GENE:MYPU_7130} - Mycoplasma pulmonis 


259 


(Q9JU23) Thioredoxin-reductase (EC 1 .6.4.5) {GENETRXB OR 
NMA1 538} - Neisseria meningitidis (serogroup A) , 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


260 


(Q9JZ28) Thioredoxin-reductase {GENE:NMB1324} - Neisseria 
meningitidis (serogroup B) 


261 


(Q9I0M2) Thioredoxin-reductase 1 {GENE:TRXB1 ORPA2616} 
- Pseudomonas aeruginosa . 


262 


(Q9I592) Thioredoxin-reductase 2 {GENE.TRXB2 OR PA0849} 
- Pseudomonas aeruginosa 


263 


(Q9V0Q8) THioredoxin-reductase (TRXB) {GENE.TRXB OR 
PAB0500} - Pyrococcus abyssi 


264 


(Q9ZD33) THioredoxin-reductase (TRXB2) {GENE:RP514} - 
Rickettsia prowazekii 


265 


(054079) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB} - 
Staphylococcus aureus 


266 


(Q9RIS2) Thioredoxin-reductase {GENE:TRXB OR TRXB2} - 
Streptomyces coelicolor 


267 


(Q9K4L6) Thioredoxin-reductase {GENE:SC5F8.08C} - 
Streptomyces coelicolor 


268 


(Q97PY2) Thioredoxin-reductase {GENE:SP1458} - . 
Streptococcus pneumoniae 


269 


(Q9A0B5) Thioredoxin-reductase {GENE:SPY0850} - 
Streptococcus pyogenes 


270 


(Q97V69) Thioredoxin-reductase (trxB-2) (EC 1 .6.4.5) 
{GENE:TRXB-2} - Sulfolobus solfataricus 


271 


(Q97W27) Thioredoxin-reductase (trxB-3) (EC 1 .6.4.5) 
{GENE:TRXB-3} - Sulfolobus solfataricus 


272 


(Q97WJ5) Thioredoxin-reductase (trxB-1) (EC 1.6.4.5) 
{GENE:TRXB-1} - Sulfolobus solfataricus 


273 


(Q98I59) Thioredoxin-reductase {GENE:MLL2552} - Rhizobium 
loti (Mesorhizobium loti) 


I 274 


(Q98M06) Thioredoxin-reductase {GENE:MLL0792} - 
Rhizobium loti (Mesorhizobium loti) 



WO 02/50289 



PCTAJS01/50240 



-121- 



SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 




\U^UnOv| OO KUd 1 niOi cUOAII 1 IcUUOldbtJ n W 1 vlU LvJVJ 

(FRAGMENT) {GENE:TRR1 AND YDR353W} - Saccharomyces 
cerevisiae (Baker's yeast) 


276 


(Q9ZEH4) THIOREDOXIN {GENE:TRXA OR SA0992} - 
Staphylococcus aureus, Staphylococcus aureus subsp. aureus 
N315 


277 


(Q9S1H1) Thioredoxin-reductase (Fragment) {GENErTRXB} - 
Staphylococcus xylosus 


278 


(Q9HJI4) Thioredoxin-reductase {GENE:TA0984} - j 
Thermoplasma acidophilum 


279 


(Q9WZX3) THioredoxin-reductase {GENE:TM0869} - 
Thermotoga maritima 


280 


(Q979K8) Thioredoxin-reductase {GENE.TVG1 183005} - 
Thermoplasma volcanium 


281 


(Q9PR71) Thioredoxin-reductase {GENE:TRXB OR UU074} - 
Ureaplasma parvum (Ureaplasma urealyticum biotype 1 ) 


282 


(Q9KSS4) Thioredoxin-reductase {GENE:VC1 182} - Vibrio 
cholerae 


283 


(Q9PDD1) Thioredoxin-reductase {GENE:XF1448} -Xylella 
fastidiosa 


284 


(Q9X5F7) Thioredoxin-reductase {GENE:TRXB1} - Zymomonas 
mobilis 


ANIMAL THIOREDOXIN-REDUCTASES 


285 


(Q9GKW9) Thioredoxin-reductase 3 (Fragment) {GENE:TRXR3} 
- Bos taurus (Bovine) 


286 


(Q9N2I8) Thioredoxin-reductase (EC 1.6.4.5) - Bos taurus 
(Bovine) 


287 


(Q9N2K1) Thioredoxin-reductase homolog - Caenorhabditis 
elegans 


288 


(Q9NJH3) Thioredoxin-reductase - Caenorhabditis elegans || 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


289 


(Q9VNT5) CG11401 PROTEIN (THioredoxin-reductase 2) 
{GENE:TRXR-2 OR CG1 1401} - Drosophila melanogaster (Fruit 
fly) 


290 


(095840) Thioredoxin-reductase - Homo sapiens (Human) 


291 


(Q9UES8) Thioredoxin-reductase GRIM-12 - Homo sapiens 
(Human) 


292 


(Q9UH79) Thioredoxin-reductase {GENE:TR} - Homo sapiens 
(Human) 


293 


(Q9UQU8) Thioredoxin-reductase - Homo sapiens (Human) 


294 


(Q9NNW6) Thioredoxin-reductase TR2 (Fragment) - Homo | 
sapiens (Human) 


295 


(Q9NNW7) Thioredoxin-reductase TR3 - Homo sapiens 
(Human) 


296 


(Q9P1 01) Thioredoxin-reductase 3 (Fragment) {GENE:TRXR3} 
- Homo sapiens (Human) 


297 


(Q9P2Y0) Thioredoxin-reductase II beta (EC 1 .6.4.5) - Homo 
sapiens (Human) 


298 


(Q9H2Z5) Mitochondrial thioredoxin-reductase 
{GENE:TRXR2A} - Homo sapiens (Human) 


299 


(Q99475) KM-102-DERIVED REDUCTASE-LIKE FACTOR 
(THioredoxin-reductase) - Homo sapiens (Human) 


300 


(Q99P49) Thioredoxin-reductase 1 {GENE:TXNRD1} - Mus 
musculus (Mouse) 


301 


(Q9CSV5) Thioredoxin-reductase 1 (Fragment) 
{GENE:TXNRD1} - Mus musculus (Mouse) 


302 


(Q9CZE5) Thioredoxin-reductase 1 {GENE:TXNRD1} - Mus 
musculus (Mouse) 


303 


(Q9JHA7) Thioredoxin-reductase TR3 {GENE:TXNRD2 OR 
TR3} - Mus musculus (Mouse) 


304 


(Q9JLT4) Thioredoxin-reductase {GENE:TXNRD2 OR TRXR2} - 
Mus musculus (Mouse) 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


305 


(Q9JMH5) Thioredoxin-reductase 2 {GENE:TXNRD2 OR 
TXNRD2} - Mus musculus (Mouse) 


306 


(Q9JMH6) Thioredoxin-reductase 1 {GENE:TXNRD1 OR 
TXNRD1} - Mus musculus (Mouse) 


307 


(089049) Thioredoxin-reductase - Rattus norvegicus (Rat) 


308 


(Q9JKZ3) Thioredoxin-reductase 1 (Fragment) - Rattus 
norvegicus (Rat) 


309 


(Q9JKZ4) Thioredoxin-reductase 1 - Rattus norvegicus (Rat) 


310 


(Q9JLE6) Thioredoxin-reductase (Fragment) - Rattus 
norvegicus (Rat) 


311 


(Q9R1I3) NADPH-dependent thioredoxin-reductase 
{GENE:TRR1 } - Rattus norvegicus (Rat) 


312 


(Q9Z0J5) Thioredoxin-reductase precursor {GENE:TRXR2} - 
Rattus norvegicus (Rat) 


313 


(Q9MYY8) Redox enzyme thioredoxin-reductase - Sus scrofa 
(Pig) 



10 
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WHAT IS CLAIMED IS: 

1 . A method of producing an oil body associated with a recombinant 
multimeric-protein-complex, said method comprising: 

(a) producing in a cell comprising oil bodies, a first recombinant polypeptide and 
5 a second recombinant polypeptide wherein said first recombinant polypeptide is 

capable of associating with said second recombinant polypeptide to form said 
multimeric-protein-complex; and 

(b) associating said multimeric-protein-complex with an oil body through an oil- 
body-targeting-protein capable of associating with said oil body and said first 

10 recombinant polypeptide, 

2. The method of claim 1 further comprising (c) isolating said oil 
bodies associated with said recombinant multimeric-protein-complex. 

3. The method of claim 1 wherein said multimeric-protein-complex 
associates with oil bodies obtainable from said cell comprising oil bodies. 

15 4. The method of claim 1 wherein said multimeric-protein-complex 

associates intracellular^ with said oil bodies. 

5. The method of claim 1 wherein said second recombinant 

polypeptide is associated with a second oil-body-targeting-protein capable of 

associating with an oil body and said second recombinant polypeptide, 
20 6. The method of claim 5 wherein each of said oil-body-targeting- 

proteins is an oil-body-protein or an immunoglobulin. 

7. The method of claim 6 wherein said oil-body-targeting-protein is an 
oleosin or caleosin. 

8. The method of claim 1 wherein said oil-body-targeting-protein is an 
25 oleosin or caleosin and said first recombinant polypeptide is fused to said oleosin 

or caleosin. 

9. The method of claim 8 wherein said second recombinant 
polypeptide is fused to a second oleosin or second caleosin capable of 
associating with an oil body. 

30 10. The method of claim 1 wherein said first and second recombinant 

polypeptides are produced as a multimeric-fusion-protein comprising said first 
and second recombinant polypeptide. 
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1 1. The method of claim 1, wherein said multimeric-protein-complex is a 
heteromultimeric-protein-complex. 

1 2. The method of claim 1 1 wherein said heteromultimeric-protein- 
complex is an enzymatically active redox complex or an immunoglobulin. 

5 13. The method of claim 1, wherein said first recombinant polypeptide is 

capable of associating with said second recombinant polypeptide in the cell. 

14. The method of claim 1 wherein said first recombinant polypeptide is 
a thioredoxin and said second recombinant polypeptide is a thioredoxin- 
reductase. 

10 15. The method of claim 14, wherein said thioredoxin is selected from 

the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID N0s:52-194. 

1 6. The method of claim 1 4, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

15 17. The method of claim 1 wherein said cell is a plant cell. 

18. The method of claim 1 wherein said cell is a safflower cell. 

1 9. The method of claim 1 wherein said first recombinant polypeptide is 
an immunoglobulin-polypeptide-chain. 

20. The method of claim 1 wherein said first recombinant polypeptide is 
20 an immunoglobulin light chain, or an immunologically active portion thereof, and 

said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 

21. The method of claim 19 wherein said oil-body targeting-protein 
comprises protein A, protein L or protein G. 

25 22. A method of expressing a recombinant multimeric-protein-complex 

comprising a first and second recombinant polypeptide in a cell, said method 
comprising: 

(a) introducing into a cell a first chimeric nucleic acid sequence comprising: 

(i) a first nucleic acid sequence capable of regulating transcription 
30 in said cell operatively linked to; 

(ii) a second nucleic acid sequence encoding a first recombinant 

polypeptide; 
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(b) introducing into said cell a second chimeric nucleic acid sequence comprising: 

(i) a third nucleic acid sequence capable of regulating transcription 
in said cell operatively linked to; 

(ii) a fourth nucleic acid sequence encoding a second recombinant 

5 polypeptide; 

(c) growing said cell under conditions to permit expression of said first and 
second recombinant polypeptide in a progeny cell comprising oil bodies wherein 
said first recombinant polypeptide and said second recombinant polypeptide are 
capable of forming a multimeric-protein-complex; and 

10 (d) associating said first recombinant polypeptide with an oil body through an oil- 

body-targeting-protein capable of associating with said oil body and said first 

recombinant polypeptide. 

23. The method of claim 22 further comprising (e) isolating from said 

progeny cell, oil bodies comprising said multimeric-protein-complex. 
15 24. The method of claim 22 wherein said multimeric-protein-complex 

associates with said oil bodies obtainable from said progeny cell comprising oil 

bodies. 

25. The method of claim 22 wherein said oil bodies associate 
intracellular^ with said multimeric-protein-complex. 
20 26. The method of claim 22 wherein said second recombinant 

polypeptide is associated with a second oil-body-targeting-protein capable of 
associating with an oil body and said second recombinant polypeptide. 

27. The method of claim 26 wherein each of said oil-body-targeting- 
proteins is selected from an oil-body-protein or an immunoglobulin. 
25 28. The method of claim 27 wherein- said oil-body-protein is an oleosin or 

caleosin. 

29. The method of claim 28 wherein said first recombinant polypeptide is 
fused to said oleosin or caleosin. 

30. The method of claim 29 wherein said second recombinant 
polypeptide is fused to a second oleosin or second caleosin capable of 
associating with an oil body. 
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31 . The method of claim 22 wherein said first and second recombinant 
polypeptide are produced as a multimeric-fusion-protein comprising said first and 
second recombinant polypeptide. 

32. The method of claim 22 wherein said first and second recombinant 
5 polypeptide form a multimeric-protem-compiex. 

33. The method of claim 32, wherein said multimeric-protein-complex is 
a heteromultimeric-protein-complex. 

34. The method of claim 32 wherein said heteromultimeric-protein- 
complex is an enzymaticaily active redox complex or an immunoglobulin. 

10 35. The method of claim 22 wherein said first recombinant polypeptide 

and said second recombinant polypeptide are capable of forming a multimeric- 
protein-complex in said progeny cell. 

36. The method of claim 22 wherein said first recombinant polypeptide is 
a thioredoxin and said second recombinant polypeptide is a thioredoxin- 

15 reductase. 

37. The method of claim 36, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

38. The method of claim 36, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 

20 40, 44, 48, 50 and SEQ ID NOs: 195-3 13. 

39. The method of claim 22 wherein said first recombinant polypeptide is 
an immunoglobulin-polypeptide-chain, 

40. The method of claim 22 wherein said first recombinant polypeptide is 
an immunoglobulin light chain, or an immunologically active portion thereof, and 

25 said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 



41- The method of claim 39 wherein said oil-body targeting-protein 
comprises protein A, protein L or protein G. 
30 42. The method of claim 22 wherein said cell is a plant cell. • 

43. The method of claim 42 wherein said plant cell is a saf flower cell. 
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44. A method of producing in a plant a recombinant multimeric-protein- 
complex, said method comprising: 

(a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide wherein said first recombinant polypeptide is 

5 capable of associating with said oil bodies through an oil-body-targeting-protein; 

(b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide; and 

(c) sexually crossing said first plant with said second plant to produce a progeny 
plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 

10 are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 

45. The method of claim 44 wherein said second recombinant 
polypeptide is capable of associating with oil bodies through an oil-body- 

15 targeting-protein in said second plant. 

46. The method of claim 44 further comprising (d) isolating from said 
progeny plant oil bodies comprising said multimeric-protein-complex. 

47. The method of claim 44 wherein said oil-body-targeting-protein is 
selected from an oil-body-protein or an immunoglobulin. 

20 48. The method of claim 47 wherein said oil-body-protein is an oleosin or 

caleosin. 

49. The method of claim 48 wherein said first recombinant polypeptide is 
fused to said oleosin or caleosin. 

50. The method of claim 49 wherein said second recombinant 
polypeptide is fused to a second oleosin or second caleosin capable of 
associating with an oil body. 

51. The method of claim 44 wherein said first and second recombinant 
polypeptide form a multimeric-protein-complex. 

52. The method of claim 51, wherein said multimeric-protein-complex is 
a heteromultimeric-protein-complex. 

53. The method of claim 52 wherein said heteromultimeric-protein- 
complex is an enzymatically active redox complex or an immunoglobulin. 
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54. The method of claim 44 wherein said first recombinant polypeptide is 
a thioredoxin and said second recombinant polypeptide is a thioredoxin- 
reductase. 

55. The method of claim 54, wherein said thioredoxin is selected from 
5 the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID N0s:52-194. 

56. The method of claim 54, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID N0s:195-313. 

57. The method of claim 44 wherein said first recombinant polypeptide is 
10 an immunoglobulin-polypeptide-chain. 

58.. The method of claim 44 wherein said first recombinant polypeptide is 
an immunoglobulin light chain, or an immunologically active portion thereof, and 
said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 
15 59. The method of claim 57 wherein said oil-body targeting-protein 

comprises protein A, protein L or protein G. 

60. The method of claim 44 wherein said plant is safflower. 

61. A chimeric nucleic acid sequence encoding a multimeric-fusion- 
protein, said nucleic acid comprising: 

20 (a) a first nucleic acid sequence encoding an oil-body-targeting-protein 
operatively linked in reading frame to; 

(b) a second nucleic acid sequence encoding a first recombinant polypeptide; 
linked in reading frame to; 

(c) a third nucleic acid sequence encoding a second recombinant polypeptide, 
25 wherein said first and second recombinant polypeptide are capable of forming a 

multimeric-protein-complex. 

62. The nucleic acid of claim 61, wherein said oil-body-targeting-protein 
is selected from an oil-body-protein or an immunoglobulin. 

63. The nucleic acid of claim 62, wherein said oil-body-protein is an 
30 oleosin or caleosin. 

64. The nucleic acid of claim 63, wherein said multimeric-protein- 
complex is a heteromultimeric-protein-complex. 
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65. The chimeric nucleic acid sequence of claim 61 wherein said first 
and second recombinant polypeptide form an enzymatically active 
heteromultimeric redox complex or an immunoglobulin. 

66. The chimeric nucleic acid sequence of claim 65 wherein said first 
5 and second recombinant polypeptides are a thioredoxin and a thioredoxin- 

reductase. 

67. The chimeric nucleic acid of claim 66, wherein said thioredoxin is 
selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
N0s:52-194. 

10 68. The chimeric nucleic acid of claim 66, wherein said thioredoxin- 

reductase is selected from the group consisting of those set forth in SEQ ID 
NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID NOs:1 95-313. 

69. The chimeric nucleic acid of claim 65 wherein said first recombinant 
polypeptide is an immunoglobulin-polypeptide-chain. 

15 70. The chimeric nucleic acid of claim 65 wherein said first recombinant 

polypeptide is an immunoglobulin light chain, or an immunologically active 
portion thereof, and said second recombinant polypeptide is an Immunoglobulin 
heavy chain, or an immunologically active portion thereof. 

71. The chimeric nucleic acid of claim 69 wherein said oil-body targeting- 
20 protein comprises protein A, protein L or protein G. 

72. The nucleic acid of claim 61, wherein positioned between said 
nucleic acid sequence encoding an oil-body-targeting-protein and said nucleic 
acid sequence encoding a first recombinant polypeptide is a linker nucleic acid 
sequence encoding an oil-body-surface-avoiding linker amino acid sequence. 

25 73. The nucleic acid of claim 72, wherein said oil-body-surface-avoiding 

linker amino acid sequence is substantially negatively charged, or has a 
molecular weight of at least 35 kd. 

74. The nucleic acid of claim 73, wherein the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 
30 is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 
and said sequence encoding the first recombinant polypeptide. 
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75. A recombinant multimeric-fusion-protein comprising (i) an oil-body- 
targeting-protein, or fragment thereof, (ii) a first recombinant polypeptide and a 
f ill > second recombinant polypeptide, wherein said first and second recombinant 
polypeptides are capable of forming a multimeric-protein-complex. 
5 76. The recombinant multimeric-fusion-protein of claim 75 wherein said 

oil-body-targeting-protein is selected from an oil-body-protein or an 
immunoglobulin. 

77. The recombinant multimeric-fusion-protein of claim 76 wherein said 

oil-body-protein is an oleosin or a caleosin. 
10 78. The recombinant multimeric-fusion-protein of claim 77, wherein said 

multimeric-fusion-protein is a heteromultimeric-fusion-protein. 

79. The recombinant heteromultimeric-fusion-protein of claim 78 wherein 

said first and second recombinant polypeptide form an enzymatically active 

heteromultimeric redox complex or an immunoglobulin. 
15 80. The recombinant fusion polypeptide of claim 79 wherein said first 

recombinant polypeptide is a thioredoxin and said second recombinant 

polypeptide is a thioredoxin-reductase. 

81. The recombinant fusion polypeptide of claim 80, wherein said 
thioredoxin is selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 

20 and SEQ ID NOs:52-1 94. 

82. The recombinant fusion polypeptide of claim 80, wherein said 
thioredoxin-reductase is selected from the group consisting of those set forth in 
SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID NOs:195-313, 

83. The recombinant fusion polypeptide of claim 75, wherein positioned 
25 between said oil-body-targeting-protein and said first recombinant polypeptide is 

an oil-body-surface-avoiding linker amino acid sequence. 

84. The recombinant fusion polypeptide of claim 83, wherein said oil- 
body-surface-avoiding linker amino acid sequence is substantially negatively 
charged, or has a molecular weight of at least 35 kd. 

30 85. The recombinant fusion polypeptide of claim 84, wherein the fusion 

polypeptide further comprises a linker amino acid sequence that is specifically 
cleavable by an enzyme or a chemical, wherein the linker sequence is positioned 
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between the oil-body-surface-avoiding linker amino acid sequence and said first 
recombinant polypeptide. 

86. Isolated oil bodies comprising a multimeric-protein-complex 
comprising (i) an oil-body-targeting-protein and (ii) a first recombinant 

5 polypeptide, said oil bodies further comprising a second recombinant 

polypeptide, wherein said first and second recombinant polypeptide are capable 
of forming a multimeric-protein-complex. 

87. Isolated oil bodies of claim 86 wherein said oil-body-targeting-protein 
is an oil-body-protein or an immunoglobulin. 

10 88. Isolated oil bodies of claim 87 wherein said oil-body-protein is an 

oleosin or a caleosin. 

89. Isolated oil bodies of claim 88 wherein said first recombinant 
polypeptide is fused to said oleosin or caleosin. 

90. Isolated oil bodies of claim 86 wherein said first recombinant 
15 polypeptide is fused to said second recombinant polypeptide. 

91. The isolated oil bodies of claim 90, wherein said multimeric-protein- 
complex is a heteromultimeric-protein-complex. 

92. The isolated oil bodies of claim 91 wherein said heteromultimeric- 
protein-complex is an enzymatically active redox complex or an immunoglobulin. 

20 93, Isolated oil bodies comprising 

(a) a first fusion protein comprising a first oil-body-targeting-protein fused to a 
first recombinant polypeptide; and 

(b) a second fusion protein comprising a second oil-body-targeting-protein fused 
to a second recombinant polypeptide, 

25 wherein said first and second recombinant polypeptide are capable of forming a 
multimeric-protein-complex. 

94. Isolated oil bodies of claim 93 wherein said first oil-body-targeting- 
protein is an oil-body-protein or an immunoglobulin. 

95. Isolated oil bodies according claim 93 wherein said first oil-body- 
30 protein is an oleosin or a caleosin. 

96. The isolated oil bodies of claim 93, wherein said multimeric-protein- 
complex is a heteromultimeric-protein-complex. 
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97. Isolated oil bodies of claim 93 wherein said first and second 
recombinant polypeptide form an enzymatically active heteromultimeric redox 
complex or an immunoglobulin. 

98. Isolated oil bodies of claim 93 wherein said first recombinant 
5 polypeptide is a thioredoxin and said second recombinant polypeptide is a 

thioredoxin-reductase. 

99. The oil bodies of claim 98, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

100. The oil bodies of claim 98, wherein said thioredoxin-reductase is 
10 selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 

40, 44, 48, 50 and SEQ ID NOs:195-313. 

101. The oil bodies of claim 93 wherein said first recombinant 
polypeptide is an immunoglobulin-polypeptide-chain. 

102. The oil bodies of claim 93 wherein said first recombinant 

15 polypeptide is an immunoglobulin light chain, or an immunologically active 

portion thereof, and said second recombinant polypeptide is an immunoglobulin 
heavy chain, or an immunologically active portion thereof. 

103. The oil bodies of claim 101 wherein said oil-body targeting-protein 
comprises protein A, protein L or protein G. 

20 104. A cell comprising oil bodies and (i) an oil-body-targeting-protein, (ii) 

a first recombinant polypeptide and (iii) a second recombinant polypeptide 
wherein 

(1) said first recombinant polypeptide is capable of associating with said oil- 
body-targeting-protein; and 
25 (2) said first recombinant polypeptide capable of associating with said second 
recombinant polypeptide to form a multimeric-protein-complex. 

105. The cell of claim 104 wherein said oil-body-targeting-protein is an 
oil-body-protein or an immunoglobulin. 

106. The cell of claim 105 wherein said oil-body-protein is an oleosin or 
30 caleosin. 
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107. The cell of claim 104 wherein said first recombinant polypeptide is 
fused to said second recombinant polypeptide so as to form a multimeric-fusion- 
protein. 

108. The cell of claim 107 wherein said multimeric-fusion-protein is a 
5 heteromultimeric-fusion-protein. 

109. The cell of claim 104 wherein said first recombinant polypeptide is 
fused to said oil-body-targeting-protein. 

110. The cell of claim 104 wherein said first recombinant polypeptide is 
fused to said first oil-body-targeting-protein and said second polypeptide is fused 

10 to a second oil-body-targeting-protein. 

111. The cell of claim 1 04 wherein said second recombinant polypeptide 
is capable of associating with a second oil-body-targeting-protein. 

112. The cell of claim 104 wherein said first and second recombinant 
polypeptide form a heteromultimeric-protein-complex. 

15 113. The cell of claim 104 wherein said heteromultimeric-protein- 

complex is an enzymatically active redox complex or an immunoglobulin. 

114. The cell of claim 104 wherein said first polypeptide is a thioredoxin 
and said second polypeptide is a thioredoxin-reductase. 

115. The cell of claim 1 14, wherein said thioredoxin is selected from the 
20 group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

116. The cell of claim 114, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

117. The cell of claim 104 wherein said first recombinant polypeptide is 
25 an immunoglobulin-polypeptide-chain. 

118. The cell of claim 104 wherein said first recombinant polypeptide is 
an immunoglobulin light chain, or an immunologically active portion thereof, and 
said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 

30 119. The cell of claim 117 wherein said oil-body targeting-protein 

comprises protein A, protein L or protein G. 

120. The cell of claim 104 wherein said cell is obtained from a plant. 
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121. The cell of claim 104 wherein said cell is obtainable from a 
safflower plant. 

122. A plant comprising cells of claim 104. 

123. A safflower plant comprising cells of claim 104. 

5 1 24. The method of claim 2 wherein said first recombinant polypeptide is 

a thioredoxin and said second recombinant polypeptide is a thioredoxin- 
reductase, said method further comprising (d) formulating the oil bodies for use 
in the preparation of a food product, personal care product or pharmaceutical 
composition. 

10 125. The method of claim 124, wherein said thioredoxin is selected from 

the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 
1 26. The method of claim 1 24, wherein said thioredoxin-reductase is 

selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 

40, 44, 48, 50 and SEQ ID NOs:195-313. 
15 127. The method of claim 124 wherein said formulating comprises the 

addition of NADP or NADPH. 

128. The method of claim 124 wherein said food product is a milk or 
wheat based food product. 

129. The method of claim 124 wherein said personal care product 

20 reduces the oxidative stress to the surface area of the human body or is used to 
lighten the skin. 

1 30. The method of claim 1 24 wherein said pharmaceutical composition 
is used to treat chronic obstructive pulmonary disease (COPD), cataracts, 
diabetes, envenomation, bronchopulmonary disease, malignancies, psoriasis, 

25 reperfusion injury, wound healing, sepsis, Gl bleeding, intestinal bowel disease 
(IBD), ulcers, GERD (gastro esophageal reflux disease). 

131. A composition comprising isolated oil bodies, thioredoxin and 
thioredoxin-reductase. 

132. The composition of claim 131, wherein said thioredoxin is selected 
30 from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52- 

194. 
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133. The composition of claim 131 f wherein said thioredoxin-reductase 
is selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

134. The composition of claim 131 further comprising NADP or NADPH. 
5 135. A food product, personal care product or pharmaceutical 

composition comprising the composition of claim 131. 

136. The food product, personal care product or pharmaceutical 
composition of claim 135 further comprising NADP or NADPH. 

137. The food product of claim 135 wherein said food product is a milk 
10 based or wheat based food product. 

138. The personal care product of claim 135 wherein said personal care 
product reduces the oxidative stress to the surface area of the human body or is 
used to lighten the skin. 

139. The pharmaceutical composition of claim 135 wherein said 
15 pharmaceutical composition is used to treat chronic obstructive pulmonary 

disease, cataracts, psoriasis or reperfusion injury. 

140. The multimeric-fusion-protein of claim 75, wherein said fusion- 
protein contains two or more polypeptide chains selected from the group of 
proteins set forth in Figure 5. 

20 141. A method of reducing allergenicity of a food comprising the steps 

of: 

providing the isolated oil bodies of claim 78; and 
adding the isolated oil bodies to the food, whereby allergenicity of 
the food is reduced. 

25 142. The method of claim 141, wherein the food is selected from the 

group consisting of wheat flour, wheat dough, milk, cheese, yogurt and ice 
cream. 

143. The method of claim 141, further comprising providing NADH as a 
co-factor in the substantial absence of NADPH. 
30 144. A method of treating or protecting a target against oxidative stress, 

comprising the steps of: 

providing the recombinant fusion polypeptide of claim 46; and 
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contacting the recombinant fusion polypeptide with a target, 
wherein the target is susceptible to oxidative stress, thereby treating or 
protecting against the stress. 

145. The method of claim 144, wherein the target is selected from the 
5 group consisting of a molecule, a molecular complex, a cell, a tissue, and an 

organ. 

146. A method for preparing an enzymatically active redox protein 
associated with oil bodies comprising: 

a) producing in a cell a redox fusion polypeptide comprising a first 
10 redox protein linked to a second redox protein; 

b) associating said redox fusion polypeptide with oil bodies through 
an oil-body-targeting-protein capable of associating with said redox fusion 
polypeptide and said oil bodies; and 

c) isolating said oil bodies associated with said redox fusion 
15 polypeptide. 

147. The method of claim 146 wherein said oil-body-targeting-protein is 
an oil-body-protein or an immunoglobulin. 

148. The method of claim 146 wherein said oil-body-protein is an oleosin 
or a caleosin. 

20 149. The method of claim 146 wherein said first redox protein is a 

thioredoxin and said second redox protein is a thioredoxin-reductase. 

150. The method of claim 149, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

151. The method of claim 149, wherein said thioredoxin-reductase is 
25 selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 

40, 44, 48, 50 and SEQ ID NOs:195-313. 

1 52. The method of claim 146 wherein said cell is a plant cell. 

153. The method of claim 146 wherein said cell is a safflower cell. 

1 54. A method for preparing a redox protein associated with oil bodies 
30 comprising: 

a) introducing into a cell a chimeric nucleic acid sequence comprising: 
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1 ) a first nucleic acid sequence capable of regulating 
transcription in said cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant 
fusion polypeptide comprising (i) a nucleic acid sequence encoding 

5 a sufficient portion of an oil-body-protein to provide targeting of 

said recombinant fusion polypeptide to an oil body linked to (ii) a 
nucleic acid sequence encoding a redox fusion polypeptide 
comprising a first redox protein linked to a second redox protein 
operatively linked to; 
10 3} a third nucleic acid sequence capable of terminating 

transcription in said cell; 

b) growing said cell under conditions to permit expression of said 
redox fusion polypeptide in a progeny cell comprising oil bodies; and 

c) isolating from said progeny cell said oil bodies comprising said 
15 redox fusion polypeptide. 

155. The method of claim 154, wherein positioned between said nucleic 
acid sequence encoding a sufficient portion of an oil-body-protein and said 
nucleic acid sequence encoding a redox fusion polypeptide is a linker nucleic 
acid sequence encoding an oil-body-surface-avoiding linker amino acid sequence. 
20 156. The method of claim 155, wherein said oil-body-surface-avoiding 

linker amino acid sequence is substantially negatively charged, or has a 
molecular weight of at least 35 kd. 

157. The method of claim 156, wherein the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 

25 is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surf ace-avoiding linker amino acid sequence 
and said nucleic acid sequence encoding a redox fusion polypeptide. 

158. The method of claim 157, further comprising introducing an 
enzyme or chemical that cleaves said redox fusion polypeptide from said oil 

30 body, thereby obtaining isolated redox fusion polypeptide. 

159. The method of claim 154 wherein said oil-body-protein is an oleosin 
or a caleosin. 
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160, The method of claim 154 wherein said first redox protein is a 
thioredoxin and said second redox protein is a thioredoxin-reductase. 

161. The method of claim 1 60, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID N0s:52-194. 

5 162. The method of claim 160, wherein said thioredoxin-reductase is 

selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID N0s:195-313. 

163. The method of claim 154 wherein said cell is a plant cell. 

1 64. The method of claim 1 54 wherein said thioredoxin and thioredoxin- 
10 reductase is obtained from Arabidopsis. 

165. The method of claim 146 wherein the first redox protein is at least 
5 times more active when produced as a redox fusion polypeptide as compared 
to the production of the first redox protein without the second redox protein. 

166. The method of claim 146 further comprising: 

15 d) formulating an emulsion of the oil bodies associated with the redox 

fusion polypeptide for use in the preparation of a product capable of treating 
oxidative stress in a target, a product capable of chemically reducing a target, 
pharmaceutical composition, a personal care product or a food product. 

167. A chimeric nucleic acid comprising: 

20 1 ) a first nucleic acid sequence capable of regulating transcription in 

a host cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a nucleic acid sequence encoding a sufficient portion 
of an oil-body-protein to provide targeting of said recombinant fusion polypeptide 

25 to an oil body linked to (ii) a nucleic acid sequence encoding a redox fusion 
polypeptide comprising a first redox protein linked to a second redox protein 
operatively linked to; 

3) a third nucleic acid sequence capable of terminating transcription 
in said cell. 

30 168. The chimeric nucleic acid of claim 167 wherein said oil-body-protein 

is an oleosin or a caleosin. 
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169. The chimeric nucleic acid of claim 167 wherein said first redox 
protein is a thioredoxin and said second redox protein is a thioredoxin-reductase. 

170. The chimeric nucleic acid of claim 169, wherein said thioredoxin is 
selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 

5 NOs:52-194. 

171. The chimeric nucleic acid of claim 169, wherein said thioredoxin- 
reductase is selected from the group consisting of those set forth in SEQ ID 
NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID NOs:195-313. 

172. The chimeric nucleic acid of claim 167 wherein said cell is a plant 

10 cell. 

173. The chimeric nucleic acid of claim 167, wherein positioned between 
said nucleic acid sequence encoding a sufficient portion of an oil-body-protein 
and said nucleic acid sequence encoding a redox fusion polypeptide is a linker 
nucleic acid sequence encoding an oil-body-surface-avoiding linker amino acid 

1 5 sequence. 

174. The chimeric nucleic acid of claim 173, wherein said oil-body- 
surface-avoiding linker amino acid sequence is substantially negatively charged, 
or has a molecular weight of at least 35 kd. 

175. The chimeric nucleic acid of claim 174, wherein the gene fusion 
20 further comprises a linker nucleic acid sequence encoding an amino acid 

sequence that is specifically cleavable by an enzyme or a chemical, wherein the 
linker sequence is positioned between the oil-body-surface-avoiding linker amino 
acid sequence and said nucleic acid sequence encoding a redox fusion 
polypeptide. 

25 176. A transgenenic plant comprising the chimeric nucleic acid sequence 

of claim 167. 

177. The transgenic plant of claim 176, wherein said chimeric nucleic 
acid is contained within a piastid. 

178. A safflower plant comprising the chimeric nucleic acid of anyone of 
30 claim 167. 

179. The safflower plant of claim 178, wherein said chimeric nucleic 
acid is contained within a piastid. 
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180. A plant seed comprising the chimeric nucleic acid of claim 167. 

181. The plant seed of claim 180, wherein said chimeric nucleic acid is 
contained within a plastid. 

182. A safflower seed comprising the chimeric nucleic acid of claim 168. 
5 183. The safflower seed of claim 182, wherein said chimeric nucleic acid 

is contained within a plastid. 

184. An oil body preparation obtained by the method of claim 146. 

185. A food product comprising an oil body preparation of claim 184. 

186. A composition comprising an oil body preparation of claim 184. 

10 187. A personal care product comprising an oil body preparation of claim 

184. 

188. A product capable of treating oxidative stress in a target comprising 
an oil body preparation of claim 1 84. 

189. A product capable of chemically reducing a target comprising an oil 
1 5 body preparation of claim 1 84. 

190. A detergent composition comprising the product of claim 184. 

191. A method of cleansing an item, comprising administering the 
product of claim 189 to said item under conditions that promote cleansing, 

192. An emulsion formulation prepared by the method of claim 166. 
20 193- A nucleic acid construct comprising a gene fusion, wherein the 

gene fusion comprises a first region encoding an oil-body-protein or an active 
fragment thereof, operably linked to a second region encoding at least one 
thioredoxin-related protein or an active fragment thereof. 

194. The construct of claim 193, wherein the at least one thioredoxin- 
25 related protein is thioredoxin. 

195. The construct of claim 194, wherein said thioredoxin is selected 
from the group consisting of SEQ ID NOs:38 f 42, 46, 50 and SEQ ID NOs:52- 
194. 

1 96. The construct of claim 194, wherein the thioredoxin is derived from 
30 Arabidopsis or wheat. 

197. The construct of claim 193, wherein the at least one thioredoxin- 
related protein is thioredoxin-reductase. 
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198. The construct of claim 197, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs: 195-3 13. 

199. The construct of claim 197, wherein the thioredoxin-reductase is 
5 derived from Arabidopsis or wheat. 

200. The construct of claim 1 97, wherein the thioredoxin-reductase is an 
NADPH-dependent thioredoxin-reductase. 

201. The construct of claim 193, wherein the second region encodes a 
thioredoxin and thioredoxin-reductase. 

10 202. The construct of claim 201, wherein the thioredoxin and 

thioredoxin-reductase is obtained from Mycobacterium leprae. 

203. The construct of claim 201, wherein at least one thioredoxin-related 
protein is an engineered fusion protein. 

204. The construct of claim 1 93, wherein the first region precedes, in a 
15 5 ' to 3 ' direction, the second region. 

205. The construct of claim 193, wherein the first region follows, in a 
5 ' to 3 ' direction, the second region. 

206. The construct of claim 193, wherein the gene fusion further 
comprises a third region encoding a second thioredoxin-related protein or an 

20. active fragment thereof, operably linked to the first region, or to the second 
region, or to both. 

207. The construct of claim 193, further comprising a seed-specific 
promoter operably linked to the gene fusion. 

208. The construct of claim 207, wherein the promoter is a phaseolin 
25 promoter. 

209. The construct of claim 193, wherein at least one thioredoxin- 
related protein is derived from a plant species selected from the group consisting 
of Arabidopsis and wheat. 

210. The construct of claim 193, wherein at least one thioreoxin-related 
30 protein is derived from £ coli. 
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21 1. The construct of claim 193 further comprising a nucleic acid 
effective as a termination region in plant cells, operably linked to the gene 
fusion. 

212. The construct of claim 193, wherein the gene fusion further 

5 comprises a nucleic acid sequence encoding an oil-body-surface-avoiding linker 
amino acid sequence, wherein the linker amino acid sequence is positioned 
between the first region and the second region. 

213. The construct of claim 212, wherein said oil-body-surface-avoiding 
linker amino acid sequence is substantially negatively charged, or has a 

10 molecular weight of at least 35 kd. 

214. The construct of claim 213, wherein the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 
is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 

15 and the second region. 

215. The construct of claim 193, wherein a region of the gene fusion 
comprises a plurality of codons, each codon specifying a single ammo acid, 
wherein at least one of the codons is modified from a naturally occurring codon 
within the region, 

20 216. The construct of claim 215, wherein the modified codon specifies 

the same amino acid as the naturally occurring codon, and wherein the modified 
codon is modified according to a codon preference of a plant. 

217. The construct of claim 215, wherein the modified codon specifies 
an amino acid that is different from the amino acid specified by the naturally 

25 occurring codon. 

218. A transgenic plant containing a nucleic acid construct comprising a 
gene fusion, wherein the gene fusion comprises a region encoding an oil-body- 
protein or an active fragment thereof, operably linked to a region encoding a first 
thioredoxin-related protein or an active fragment thereof. 

30 219. The plant of claim 218, wherein the thioredoxin-related protein is 

thioredoxin. 
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220, The plant of claim 219, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

221. The plant of claim 219, wherein the thioredoxin is derived from 
Arabidopsis or wheat. 

5 222. The plant of claim 218, wherein the thioredoxin-related protein is 

thioredoxin-reductase. 

223. The plant of claim 222, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 
10 224. The plant of claim 222, wherein the thioredoxin-reductase is an 

NADPH-dependent thioredoxin-reductase. 

225. The plant of claim 218, wherein the construct is contained within a 

plastid. 

226. The plant of claim 218, wherein the first thioredoxin-related protein 
15 is thioredoxin and wherein said construct further comprises a region encoding a 

thioredoxin-reductase. 

227. The plant of claim 226, wherein the thioredoxin and thioredoxin- 
reductase is obtained from Mycobacterium leprae. 

228. The plant of claim 226, wherein the thioredoxin-related protein is 
20 an engineered fusion protein. 

229. The plant of claim 218, wherein the first region precedes, in a 5' to 
3' direction, the second region. 

230. The plant of claim 218, wherein the first region follows, in a 5' to 
3' direction, the second region. 

25 231 . The plant of claim 218, wherein the gene fusion further comprises 

a third region encoding a second thioredoxin-related protein or an active 
fragment thereof, operably linked to the first region, or to the second region, or 
to both. 

232. The plant of claim 218, further comprising a seed-specific promoter 
30 operably linked to the gene fusion. 

233. The plant of claim 232, wherein the promoter is a phaseolin 
promoter. 
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234. The plant of claim 218, wherein the thioredoxin-related protein is 
derived from a plant species selected from the group consisting of Arabidopsis 
and wheat. 

235. The plant of claim 218, wherein the thioredoxin-related protein is 
5 derived from £ coli. 

236. The plant of claim 218 further comprising a nucleic acid effective 
as a termination region in plant cells, operably linked to the gene fusion. 

237. The plant of claim 218, wherein the gene fusion further comprises 
a nucleic acid sequence encoding an oil-body-surface-avoiding linker amino acid 

10 sequence, wherein the nucleic acid encoding the linker amino acid sequence is 
positioned between the region encoding an oil-body-protein and the region 
encoding a first thioredoxin-related protein. 

238. The plant of claim 237, wherein said oil-body-surface-avoiding 
linker amino acid sequence is substantially negatively charged, or has a 

15 molecular weight of at least 35 kd. 

239. The plant of claim 238, wherein the gene fusion further comprises 
a linker nucleic acid sequence encoding an amino acid sequence that is 
specifically cleavabJe by an enzyme or a chemical, wherein the linker sequence is 
positioned between the oil-body-surface-avoiding linker amino acid sequence and 

20 the region encoding a first thioredoxin-related protein. 

240. The plant of claim 218, wherein a region of the gene fusion 
comprises a plurality of codons, each codon specifying a single amino acid, 
wherein at least one of the codons is modified from a naturally occurring codon 
within the region. 

25 241. The plant of claim 240, wherein the modified codon specifies the ■ 

same amino acid as the naturally occurring codon, and wherein the codon is 
modified according to a codon preference of a plant. 

242. The plant of claim 240, wherein the modified codon specifies an 
amino acid that is different from the amino acid specified by the naturally 

30 occurring codon. 
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243. The plant of claim 218, wherein the plant is selected from the 
group consisting of Arabidopsis and safflower. 

244. A transgenic plant comprising a nucleic acid construct a seed- 
specific promoter operably linked to a gene fusion, wherein the gene fusion 

5 comprises a region encoding an oil-body-protein or an active fragment thereof, 
operably linked to a region encoding a first thioredoxin-related protein or an 
active fragment thereof, wherein a fusion protein comprising activities of oleosin 
and the thioredoxin-related protein is produced in a seed of the plant. 

245. The transgenic plant of claim 244, wherein the plant is selected 
10 from the group consisting of Arabidopsis and safflower. 

246. The transgenic plant of claim 244 wherein the promoter is a 
phaseolin promoter. 

247. The seed of the plant of claim 244. 

248. The seed of claim 247, comprising a thioredoxin-related protein in a 
15 concentration of at least about 0.5% of total cellular seed protein. 

249. An extract of the seed of claim 247, wherein the extract comprises 
an activity of a thioredoxin-related protein. 

250. An oil body from the seed of claim 247. 

251 . Oil produced from the seed of claim 247. 

20 252. A method of making a fusion protein comprising a thioredoxin- 

related activity, the method comprising the steps of: 

providing a transgenic plant comprising a nucleic acid construct 

comprising a seed-specific promoter operably linked to a gene 
fusion, wherein the gene fusion comprises a region encoding an 
25 oil-body-protein or an active fragment thereof, operably linked to a 

region encoding a first thioredoxin-related protein or an active 
fragment thereof, the gene fusion encoding a fusion protein 
comprising a thioredoxin-related activity; 
obtaining seeds from the plant; and 
30 recovering the fusion protein by isolating oil bodies from the seeds. 

253. The method of claim 252, further comprising the step of 
fractionating the oil bodies to achieve partial purification of the fusion protein. 
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254. Oil bodies in association with a fusion protein, obtained by the 
method of claim 252. 

255. The method of claim 252 further comprising a step of cleaving the 
oil-body-protein from the thioredoxin-related protein after fractionation of the oil 

5 bodies. 

256. The method of claim 255, wherein the cleaving step comprises use 
of a protease. 

257. The method of claim 255, wherein the cleaving step comprises 
chemical proteolysis. 

10 258. A method of reducing allergenicity of a food comprising the steps 

of: 

providing a preparation comprising oil bodies associated with a 
fusion protein, the fusion protein comprising an oil-body-protein or an 
active fragment thereof and a thioredoxin-related protein or an active 
15 fragment thereof; and 

adding the preparation to the food, whereby allergenicity of the 
food is reduced due to activity of the thioredoxin-related protein or 
fragment. 

259. The method of claim 258, wherein the food is selected from the 
20 group consisting of wheat flour, wheat dough, milk, cheese, yogurt and ice 

cream. 

260. The method of claim 258, further comprising providing NADH as a 
co-factor in the substantial absence of NADPH. 

261. A composition comprising a fusion protein, the fusion protein 

25 comprising an oil-body-protein or an active fragment thereof and a thioredoxin- 
related protein or an active fragment thereof, in a pharmaceutical^ acceptable 
carrier. 

262. The composition of claim 261, further comprising oil bodies in 
association with the fusion protein. 

30 263. A cosmetic formulation comprising oil bodies associated with a 

fusion protein, the fusion protein comprising an oil-body-protein or an active 
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fragment thereof and a thioredoxin-related protein or an active fragment thereof, 
in a pharmaceutically acceptable carrier. 

264. A method of treating or protecting a target against oxidative stress, 
comprising the steps of: 

5 providing a preparation comprising a fusion protein, the fusion 

protein comprising an oil-body-protein or an active fragment thereof and a 
thioredoxin-related protein or an active fragment thereof; and 

contacting the preparation with a target, wherein the target is 
susceptible to oxidative stress, thereby treating or protecting against the 
10 stress. 

265. The method of claim 264, wherein the target is selected from the 
group consisting of a molecule, a molecular complex, a cell, a tissue, and an 
organ. 

266. A nucleic acid construct comprising a gene fusion, wherein the 
15 gene fusion comprises a first region encoding an oil-body-protein or an active 

fragment thereof, operably linked to a second region encoding at least one 
polypeptide or an active fragment thereof, and an oil-body-surface-avoiding linker 
in frame between the first and second region polypeptides. 



WO 02/50289 



1/8 



PCT/US01/50240 



TO 

ATTHtRSDQ 



. TO ■ 
ATTHiREDB 



TR 

ATTWREDB 



fr^^Ci^ ^^gg^-Al AlaC Aide c feSSL 



a— 



TO 

ATTHREDB \ 



ATTHtREDa I 



JW SO 330 



«0 

dc T 

JT C 



«0 ^ 



TO 

ATTHIREOB 



Sn— bees ^j^emsa^Ksaaie^ 



TO 

ATTHIRBQB 



TR , I 
ATTHlREOa | 



TO 

ATTHtREQQ 



TOO BOO 810 020 030 040 



TO 

ATTHlREOa 



iirw no** . mo 



FIGURE 1 



WO 02/50289 



2/8 



PCT/US01/50240 



no/ATTbUREDB 
TranstaBonafTR 



Ili?^o5ll 



Translation ofATTHlREDB 
Translation of TR 



Translation c/ATTHRBDB I 
Translation of TR \ 


130 HO tsa 




170 




TranstoOonotATTHReoe 1 
Translation of TR f 


no 200 210 


220 


230 


240 












799 TOO 270 


280 






Translation at ATThBREDB I 
Translation of TR \ 








W 



Translation of ATTHMEQ8 



FIGURE 2 



WO 02/50289 PCTVUSO 1/50240 

3/8 



M.tep TR/Trxh 
Arab TR-Unk-Trxh 



I I £p|ArtOE T I H BfV M 
G L E rTjHUfjT - - - Rgklc 




MJep TR/Trxh 
Arab TR-Unk-Trxh 



R N 

P. I 



at 



mi 



100 

Tj^jT k[#|d P 



no ^ no 

MJepTR/Trxh R Q P I K sfylvf^U E 5 [5^X1 X Q^^P^jp^ 

Arab TR-Unk-Trxh s s K P F P£]»jl> i : a| T l[X]2^H£^ 



130 ' 140 
M|£>I|s v[r|yP]q i pFg]e ^m. - - - L L[G£ 
" ^vjfjv A[K|R[lj s p yg] s gmh sogf w|5.1 



Arab TR-Unk-Trxh 



i tKBtimmmmmBBSii 



A R SplT 

Q 3 jc^Yhrvr^ 



M.lep TRTTrxh 
Arab TR-Unk-Trxh 



210 



IK FmTfJNJH 



35 U 



,LgEM a y N| 



X I ILYIt 



- A Y^D G B R D y|fc|g Jtffl8jCfcXT 



I Tf; 
V vt 



M.fc/> TR/Trxb 
Arab TR-Unk-Trxh 



asm 1 4 hi i b xteiif 

fSAlaK F[jjD G Gt^^L^Si 



Arab TR-Unk-Trxh 



mil 



JJ0 



MJep TRTTrxh 
Arab TR-Unk-Trxh 



370 380 390 

J iS n Atojy tS3e y t d a s f f ApiyfEls snxp 



bL£3g gyilCE e fceja v^a c h t y e t w N^a^JS kanbsktl 



MJep TR/Trxh - 
Arab TR-Unk-Trxh 



lymiT n p E^^RfEiF sply 



450 

p[M P L 



M.lep TR/Trxh 
Arab TR-Unk-Trxh 



fk\ B \&M I I L | D | 1C. V flSStfrAttKf - |K| 0 E^Q S T 



i^o VfV^P N L N 



FIGURE 3 



WO 02/50289 



PCT/US01/50240 



4/8 




FIGURE 4 



WO 02/50289 



5/8 



PCT7US01/50240 



HETEROMULTIMERS 



Class 


Heteromultimer 


Examnle seauence reference for 
heteromnltimeric subunits 


Biosvnthetic 


3-iriethvI-2-oxobutanoate 
dehydrogenase (2-oxoisovalerate 
dehydrogenase (lipoamide))- El 
component) 


McFCean (>t nl Rinrhfm Rinrihv<5 Arta ftQQ7\ 

1171:109-112 /Chnang,LL., e/aL FEBS Lett a 
(1990) 262 (2), 305-309. 


Biosynthetic 


3-oxoadipate CoA-transferase 


Parales, RJE. and Harwood, S.C. J. Bacteriol. (1992) 
174-4657-4666 


Biosynthetic ' 


anthranilate synthase:indole-3-grycerol 
phosphate synthase 


Zalkin, H.; ef a/. J. BioL Chem. (1984) 259:3985- 
3992. 


B ios vntli etic 


ripta-lcetoaevl-Taev] carrier nroteinl 

synthase I 


Oi ggtuu U-/VuUclovII, 1V1. c* at. JriUC, nail. /VCaU. oCI. 

U.S.A.J1991) 88:41144118 


R ifisvntfi ptir* 


Kiitvmtp— ncptnnrptafp CnA-francfpTncp 


r ibcuer,xv. j ez at. j « r> acteno L ^ l y y d ) l / j , l 


R irv<3vn fli pfi c 


r Aft/TP Hpnpnrlpnf rrmfpin Vvna<!p 


Mllfypl 1? Pro/* "Wa+l Ar^d <vr>t If Q A flGB7\ 
iviULZCA, In. ef ui. rTOC IN all. /VCaO. oCI. <J.o*/V. ^1^0/1 

84:6-107 BurHE., era/. Gene (1991) 102 (1), 57- 
65. 


Biosynthetic 


carbarnoylrphosphate synthase 


Shigenobu,S., et al Nature. (2000) 407 (6800), 81- 
86. 


Rifvrvnfh ftti p 


Orpntinp VinaQp 


oiuauciio, j, j., €i at. jjiociiciiL jDiopnys. Jtves. 
Commun. (1986) 138:392-398. / Roman, D.; et al . 

Prne Natl AnaH ^ri TT ^ A M0RS^R9-R^04 8^08 


Biosynthetic 


gamma-glutamyltransferase (gamma- 
p'hitamvl tran«*nentida*?p.^ 

glUUUU Y 1 UOUOpvUUUiUV 1 


Papandrikopoulou, A.; et al Eur. J. Biochem. 


Biosynthetic 


glutathione transferase 


Morrow, C.S. al. Gene (1989) 75:3-1 1 


R i o<3 vn tYi pfi p 


frlv/^pimlw^-rifincnViJitp rlphvrfmtTpnacp 


ouic, o. i. ei at. j, oacicnoi. ^lS'oo^ i /vi.z***to-A*tjO. 


B ios vntti ptit* 


tntyinvlatp' f*vp1a<:p 




Biosynthetic 


heterodisulfide reductase 


SminvD.K, er a/. L Bacteriol. (1997) 179 (22), 
7135-7155. 


Biosynthetic . 


human camepsin 


Ritonja, A. e/a/. FEBS Lett (1988) 228:341-345. 


Biosynthetic 


Hydrogenase 


Menon, NJKL etal J. Bacteriol. (1990) 172:1969- 

1077 


Biosynthetic 


Meprin A 


Johnson, G.D. and Hersh, L.B. J. Biol. Chem. 
(1992)267:13505-13512. 


Biosynthetic 


methionine adenosyltransferase 


Horikawa, S.; Tsukada, K. FEBS Lett (1992) 
311-37-41 


Biosynthetic 


methylmalonyl-CoA mutase 


Jackson, CA. e/fl/. Gene (1995) 167:127-132. 


B iown tfiefi f* 

uxxjoy ix Uilv Hi/ 


■mitnpTirtnri'rial "nmppecino npnfiHncp. 




Biosynthetic 


Na+/K+-exchanging ATPase 


Shull,G.E., a/. Biochemistry (1986) 25 (25), 

MoL Cell. BioL (1986) 6 (1 1), 3884-38907 
MercerJR-W., <?/ ^z/. J. Cell Biol. (1993) 121 (3), 
579-586. 


Biosynthetic 


NAJD(+)-depenrient isocitrate 
dehydrogenase 


Cupp, J.R. and McAlister-Henn, L. J. Biol. Chem. 
(1992) 267:16417-16423. /Cupp, J.R. and 
McAlister-Henn, L. 

J. Biol. Chem. (1991) 266:22199-22205. 


Biosynthetic 


phosphonTjosylfonnylglycmamidine 
synthase 


Ebbole, D.J.; Zalkin, H. J. Biol. Chem (1987) 
262:8274-8287. 


Biosynthetic 


protocatechuate 3,4-dioxygenase 


Frazee, R.W.; etal J. BacterioL (1993) 175:6194- 
6202. * 


Biosynthetic 


S-100 protein 


Engelkamp, D.; etal Biochemistry < 1992) 



FIGURE 5 



WO 02/50289 



6/8 



PCT7US01/50240 







J l . 1 uzjo-iUzM. / Auore, K.J.; et al. J. Biol. Chem. 
(1990)265:15537-15543. 


uiosyiiuieuc 


sucrose— xructan o-iruciosyirransierase 


aprenger, IN.; etal. woe. JNatl. Acad. bci. U.S.A. 
(1995)92:11652-11656. 


jDiosyniiietic 


ouperoxiae aismuiase 


uapo, U.K.; e/ -oiocnem. iJiopnys. Res. 
Cornmun. (1990) 173:1186-1193. 


Biosyntiietic 


Urease 


Labigne, A.; etal J. BacterioL (1991) 173:1920- 

1 Q1 1 


Biosynthetic 


urokinase-type plasminogen activator 
(urokinase) 


Belin, D. et al Eur. J. Biochem. (1985) 148:225- 
232. 


— : 

Biosythetic 


methylrnalonyl-coenzyme A mutase 


BuxlvA., <?/ J. Bactenol. (1993) 175 (11), 3511- 
3519. 


Calcium binding 


Calcineurin 


Muramatsu,T. and Kincaid,R.L. Biochirn. Biophys. 
Acta (1993) 1 178 (1), 1 17-120 / Guerini, D. et al 
DNA (1989) 8:675-682. 


Calcium binding 


Calgranulin 


Imamichi, T. et al Biochem. Biophys. Res. 
Commun. (1993) 194:819-825.. 


Calcium binding 


Calpain 


Aoki, K. et al FEBS Lett. (1986) 205:313-317. 


DNA binding 


API 


van Straaten,F., et al. Proceedings of the National 
Academy of Sciences of the United States of 
America.' (1983) 80 (11), 3183-3187. /Hattori,K., et 
al Proceedings of the National Academy of Sciences 
of the United States of America. (1988) 85 (23), 
9148-9152. 


DNA binding 


cMyc-Max 


Schreiber-Agus,N et al Mol. Cell. Biol. (1993) 13 
(5), 2765-2775. 


DNA binding 


DNA binding protein HU-l/HU-2 


Laine, B. etal Eur. J. Biochem. (1980) 103:447- 
461. 


DNA binding 


hepatic nuclear factor 1 


Bach,L et al Nucleic Acids Res. (1992) 20 (16), 
4199-4204. / Rey-Campos,J. etal EMBO J. (1991) 
10(6), 1445-1457. 


DNA binding . 


Integration host factor 


Miller,H.I. Cold Spring Harbor symposia on 
quantitative biology. (1984) 49, 691-698. / Flarnrn, 
E. and Weisberg, RA. J. Mol. BioL (1985) 183:117- 
128. 


DNA binding 


Ku 


Reeves, W.H. and Sthoeger,Z,M. J. BioL Chem. 

(1989)264(9), 5047-5052./ 

J. BioL Chem. (1989) 264 (23), 13407-1341 1. . 


DNA binding 


MutS 


Bockerera/. 1999. Cancer Research 59, 816-822. 


DNA binding 


NF-E2 


Chan, J. Y. et al. Proc. NatL Acad. Sci. U.S.A. 
(1993) 90 (23), 11366-113707 Tofa,T., et al. 
Oncogene (1997) 14 (16), 1901-1910. 


TYH.T A 1_ " I • 

DNA binding 


nuclear factor kB (NFkB) 


Kieran M, et al Cell. (1990) Sep 7;62(5):1007-18. / 
Ruben SM, et al Science (1991) Mar 
22;2d 1(5000): 1490-3. 

Erratum in: Science (1991) Oct 4^54(5028): 11 


Electron transport 


conuioid/iron-sulfur protein 


Lu, WJP. etal L Biol. Chem. (1993) 268:5605- 
jom. 


Electron transport 


cytochrome d ubiquinol oxidase 


Green, G.N. etal J. Biol. Chem. (1988) 263:13138- 
13143. 


Electron transport 


cytochrome-c3 hydrogenase 


Menon, NJL et al J. BacterioL (1987) 169:5401- 
5407. 


Electron transport 


electron transfer flavoprotein 


Finocchiaro, G. et al Biol. Chem. (1988) 
263:15773-15780. / Finocchiaro, G. et al Eur. X 
Biochem. (1993) 213:1003-1008. 



WO 02/50289 



7/8 



PCT/US01/50240 



Electron transport 


xylene monooxygenase 


Shaw, J.P. and Harayama, S. Eur. J. Biochem. 
yiyyjL) ZU7.J i-oi. / Jvasai, i ei at. J. oacteriOL 
(2001) 183 (22), 6662-6666. 


Growth factor 


heoatorvte PTOwth factor 


iNd&aniura, i, ei ai* in autre {iyoy/ ^*rZ.*f*m-443. 


Growth factor 


human chorionic gonadotropin 


MorganJFJ. ef a/. J. Biol. Chem. (1975) 250 (13), 
5247-5258. 


vjiuwui laULUi 


r uUClCl- UCilYCU givj W Ul IdAAOl 


l aKimoto, x er a/. Hirosnirna J. Med, ocl (1993) 
42 (1), 47-527 Josephs,S.F., e/ al Science (1984) 
225 (4662), 636-639. 


Hormone 


Bombyxin 


Adachi, T. et al J. BioL Chem. (1989) 264:7681- , 
7685. 


iiormone 


Follicle stimulating hormone 


riQdeSjJ.C and Goodrnan,H.M. J. Mol. AppL 
Genet (1981) 1 (1), 3-18. / Watkins,P.C., etal 
DNA (1987) 6 (3), 205-212. 


Hormone 


Insulin 


Bell,G.L, Pictet,R.L. 3 Rutter,W. J., Cordelia, 
Tischer,E. and Goodman,H.M. 
Sequence of the human insvdin gene. Nature. 284 
(5751), 26-32 (1980) 


Hormone 


Luteinizing Hormone 


Fiddes,J.C. and Goodman,H.M. J. Mol. AppL 
Genet. (1981) 1 (1), 3-18. / ShomeJB. and 
Fariow,A.F. J. Clin. Endocrinol. Metab. (1973) 36 
(3), 618-621. 


Hormone 


Thyroid stimulating hormone 


Fiddes,J.C. and Goodman,H.M. J. Mol. AppL 
Uenet (1981) 1 (1), 3-18. / Hayashizaki Y, et al 
FEBS Lett (1985) 188 (2), 394-400. 


irnrnunc 


B-cell antigen receptor complex 


Hasnunoto,S. etal J. ImmunoL (1993) 150 (2), 491- 
498. / Haswinkel,H. and RethJM. 
Immunogenetics (1992) 36 (4), 266-269. 


immune 


oeu surrace l>l/o moiecuies 


ureta-VidaljA., et aL Immunogenetics (1999) 49 
(7-8), 718-721. 


Immune 


human complement subcomponent Clq 


Sellar, G.C. etal Biochem. J. (1991) 274:481-490. 


immune 


T cell receptor 


laiken^BX. et al bcand. J. ImmunoL (200 1) 54 (l- 


jrnoiosyninesis 


^-pnycocyanin 


Utiner, vj.U, e/ al J. J3ioL Chem. (1981) 256:12167- 
12175. / Troxler, R.F. et al J. BioL Chem. (1981) 

ZJO.XZX /O-lZlOH. 


Photosynthesis 


■ferroredoxin-thioredoxin reductase 


Chow, L.P. al Eur.J. Biochem. (1995) 231:149- 
156. / Iwadate, H. et al Eur. J. Biochem. (1994) 

111 '4^-2171 


Photosynthesis 


Light harvesting complex I 


Proc. Natl. Acad. Sci. U.S.A. (1984) 81, 189-192. 


Photosynthetic 


cytochrome b559 


Carrillo, N. et al Curr Genet 1986;10(8):619-24. 


Protease 


ATP-dependent Clp protease 


Gerth, U. et al Gene (1996) 181:77-83. / Kunst^F. 
et aL iNature ( i yy /) jyu (00j / /, z4y-zDo. 


Receptor 


alpha-2-macroglobulin receptor 


Stricldand, D.K. al J. BioL Oiem. (1990) . 
265:1740L-17404. / Strickland, D.K. et al J. BioL ' 


Receptor 


Interleukin-2 receptor 


Ishida. N etal Nucleic Acids Res (\ 985^ 13 '7579- 
7589./ Hatakeyama,M.e/a/. Science (1989) 
244:551-556/ Tafceshita, T. al Science (1992) 
257:379-382. 


Receptor 


platelet-derived growth factor receptor 


Lee, K.H. et al Mol. Cell. BioL (1990) 10:2237- 
2246. / Herren,B. et al Bic^him. Biophys. Acta 
1173 (3), 294-302 (1993). 


Structural 


Hemoglobin 


HeindelULC. et al CeU (1978) 15 (1), 43-54. / 



WO 02/50289 



8/8 



PCT/US01/50240 







Best,J.5. et al koppe-Seyler's L. Physiol. Chem. 
(1989) 350 (5), 563-580. /Hardison,R.C. J. Biol. 
Chem. (1981) 256(22), 11780-11786. 


Structural 


human platelet glycoproteni lb 


Wenger,R.H. et al Biochem. Biophys. Res. 
Commun. (1988) 156 (1), 389-395. / YagUl etal 
J. Biol. Chem. (1994) 269 (26), 17424-17427. \ 


Structural 


Plasma fibronectin 


Kornblihtt, A.R. et ah Proc. Natl Acad. Sci. U.S.A. 
(1983) 80:3218-3222. 


Structural 


Spectrin 


Sahr, K.E. et al J. Biol. Chem. (1990) 265:4434- 
4443. / Winkelmann, J.C. et al J. Biol. Chem. 
(1990)265:11827-11832. 


Structural 


Tubulin 


Ponstingl, H. et al Proc. NatL Acad. Sci. U.S.A. 
(1981) 78:2757-2761. / Kiauhs, E. et al Proc. Natl. 
Acad. Sci. U.S.A. (1981) 78:4156-4160. 


Toxin 


Agkisacutacin 


Cheng^C et al Biochem. Biophys. Res. Commun. 
(1999) 265 (2), 530-535. 


Toxin 


Beta bungarotoxins 


Kondo, K. etal J. Biochem. (1978) 83:101-115. 


Toxin 


Crotoxin 


Bouchier,C. etal Nucleic Acids Res. (1988) 16 
(18), 9050. 


Toxin 


Mojave toxin 


John, T.R. et al Gene (1994) 139:229-234. 


Toxin 


venom protein C9S3 


, Rowan, E.G. et al Nucleic Acids Res. (1990) 
18:1639. / Joubert, FJ. and Viljoen, C.C. Hoppe- 
beylers Z. Physiol. Chem. (1979) 360:1075-1090. 


Miscellaneous 


Inhibin 


Forage, R.G. et al Proc. Natl. Acad. Sci. U.S.A. 
(1986) 83:3091-3095. 


Miscellaneous 


Monellin 


Frank, G. and Zuber, H. Hoppe-Seyler's Z. PhysioL 
Chem. (1976) 357:585-592. 


Miscellaneous 


mRNA capping enzyme 


Niles, E.G. etal, J. Virology (1986) 153:96-112. 


Miscellaneous 


Soybean insulin-binding pTotein si30 


Barbashov, SJP. et al Bioorg. Khim. (1991) 17:421- 
423. 



WO 02/50289 



PCT/US01/50240 



SEQUENCE LISTING 



<110> SemBioSys Genetics, Inc. 

Syngenta Participations AG 

<120> METHODS FOR THE PRODUCTION OF MULTIMERIC PROTEINS, AND RELATED 
COMPOSITIONS 

<130> 38814-351PC 

<140> Not Yet Assigned 
<141> Herewith 

<160> 313 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 2 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 2 

gaaagcttaa gccaagtgtt tg 22 

<210> 3 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 4 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 4 

ttaagcttca atcactctta ccttgctg 28 

<210> 5 
<211> 72 
<212> DNA 

<213> Artificial Sequence 



<400> 1 

taccatggct tcggaagaag ga 



22 



<400> 3 

ggccagcaca ctaccatgaa tggtctcgaa actcac 



36 



<220> 
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<223> Primer 
<400> 5 

actggagatg ttgactcgac ggatactacg gattggtcga cggctatgga agaaggacaa 60 
gtgatcgcct gc 72 

<210> 6 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 6 

atccgtcgag tcaacatctc cagtttcctc ggtggtctcg ttagccttcg atccagcaat 60 
ctcttgtaag aatgctctgc 80 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 7 

gtggaagctt atggagatgg ag 22 

<210> 8 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 8 

atgaatggtc tcgaaactca 
cacacggcgg cgatttacgc 
atggctaacg acatcgctcc 
cccggatttc cagaaggtat 
gagcgattcg gtactacgat 
ccgtttaagc tattcacaga 
ggagctgtgg ctaagcggct 
aaccgtggaa tctccgcttg 
cctcttgcgg tgatcggtgg 
tatggatcta aagtgtatat 
cagcagcgag ctttgtctaa 
gcttatggag atggagaaag 
ggagatgttt ctgatttaaa 
accaagtttt tggatggtgg 
ggtactacac agactagcgt 
tataggcaag ccatcactgc 
tacttacaag agattggatc 

<210> 9 
<211> 999 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

atgaatggtc tcgaaactca caacacaagg ctctgtatcg 
cacacggcgg cgatttacgc agctagggct gaacttaaac 
atggctaacg acatcgctcc cggtggtcaa ctcaaccaac 
ggatttccag aaggtattct cggagtagag ctcactgaca 
cgattcggta ctacgatatt tacagagacg gtgacgaaag 



caacacaagg ctctgtatcg 
agctagggct gaacttaaac 
cggtggtcaa ctaacaacca 
tctcggagta gagctcactg 
atttacagag acggtgacga 
ttcaaaagcc attctcgctg 
tagcttcgtt ggatctggtg 
tgctgtttgc gacggagctg 
aggcgattca gcaatggaag 
aatccatagg agagatgctt 
tcctaagatt gatgtgattt 
agatgtgctt ggaggattga 
agtttctgga ttgttctttg 
tgttgagtta gattcggatg 
tcccggagtt ttcgctgcgg 
tgcaggaact gggtgcatgg 
tcagcaaggt aagagtgatt 



taggaagtgg cccagcggca 60 
ctcttctctt cgaaggatgg 120 
ccaccgacgt cgagaatttc 180 
acaaattccg taaacaatcg 240 
aagt'cgattt ctcttcgaaa 300 
acgctgtgat tctcgctact 360 
aaggttctgg aggtttctgg 420 
ctccgatatt ccgtaacaaa 480 
aagcaaactt tcttacaaaa 540 
ttagagcgtc taagattatg 600 
ggaactcgtc tgttgtggaa 660 
aagtgaagaa tgtggttacc 720 
ctattggtca tgagccagct 780 
gttatgttgt cacgaagcct 840 
gtgatgttca ggataagaag 900 
cagctttgga tgcagagcat 960 
ga 1002 



taggaagtgg cccagcggca 60 
ctcttctctt cgaaggatgg 120 
caccgcgtga gaatttcccc 180 
aattccgtaa acaatcggag 240 
tcgatttctc ttcgaaaccg 300 
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tttaagctat tcacagattc aaaagccatt ctcgctgacg ctgtgattct cgctatcgga 360 
gctgtggcta agtggcttag cttcgttgga tctggtgaag ttctcggagg tttgtggaac 420 
cgtggaatct ccgcttgtgc tgtttgcgac ggagctgctc cgatattccg caacaaacct 480 
cttgcggtga tcggtggagg cgattctgca atggaagaag caaactttct tacaaaatat 540 
ggatctaaag tgtatataat cgataggaga gatgctttta gagcgtctaa gattatgcag 600 
cagcgagctt tgtctaatcc taagattgat gtgatttgga actcgtctgt tgtggaagct 660 
tatggagatg gagaaagaga tgtgcttgga ggattgaaag tgaagaatgt ggttaccgga 72 0 
gatgtttctg atttaaaagt ttctggattg ttctttgcta ttggtcatga gccagctacc 780 
aagtttttgg atggtggtgt tgagttagat tcggatggtt atgttgtcac gaagcctggt 840 
actacacaga ctagcgttcc cggagttttc gctgcgggtg atgttcagga taagaagtat 900 
aggcaagcca tcactgctgc aggaactggg tgcatggcag ctttggatgc agagcattac 960 
ttacaagaga ttggatctca gcaaggtaag agtgattga 999 

<210> 10 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 

<221> CDS 

<222> (1) . . . (1002) 

<223> cDNA encoding NADPH thioredoxin reductase 
<400> 10 

atg aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt 48 
Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser 
1 5 10 15 

ggc cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt 96 
Gly Pro Ala Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu 
20 25 30 

aaa cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt 144 
Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp lie Ala Pro Gly 
35 40 45 

ggt caa eta aca ace acc acc gac gtc gag aat ttc ccc gga ttt cca 192 
Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 
50 55 60 

gaa ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg 240 
Glu Gly lie Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 ~ 80 

gag cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat 288 
Glu Arg Phe Gly Thr Thr lie Phe Thr Glu Thr Val Thr Lys Val Asp 
85 90 95 

ttc tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc 336 
Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala lie Leu 
100 105 110 

get gac get gtg att etc get act gga get gtg get aag egg ctt age 384 
Ala Asp Ala Val lie Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser 
115 120 125 

ttc gtt gga tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate 432 
Phe Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He 
130 135 140 

tec get tgt get gtt tgc gac gga get get ccg ata ttc cgt aac aaa 480 
Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 
145 150 155 160 

cct ctt gcg gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac 528 
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Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 
165 170 175 

ttt ctt aca aaa tat gga tct aaa gtg tat ata ate cat agg aga gat 576 
Phe Leu Thr Lys Tyr Gly Ser Lys Val Tyr lie lie His Arg Arg Asp 
180 185 190 

get ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct 624 
Ala Phe Arg Ala Ser Lys lie Met Gin Gin Arg Ala Leu Ser Asn Pro 
195 200 205 

aag att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat 672 
Lys He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 
210 215 220 

gga gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt ace 720 
Gly Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 
225 230 " 235 240 

gga gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt 768 ' 
Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly 
245 250 255 

cat gag cca get ace aag ttt ttg gat ggt ggt gtt gag tta gat teg 816 
His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 
260 265 270 

gat ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc 864 
Asp Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro 
275 280 285 

gga gtt tte get gcg ggt gat gtt cag gat aag aag tat agg caa gee 912 
Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 
290 295 300 

ate act get gca gga act ggg tgc atg gca get ttg gat gca gag cat 960 
He Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His 
305 310 315 320 

tac tta caa gag att gga tct cag caa ggt aag agt gat tga 1002 
Tyr Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp * 
325 330 



<210> 11 
<211> 333 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 11 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 

1 5 10 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala' Glu Leu 

20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 

35 40 45 

Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 

50 55 60 

Glu Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

Glu Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

85 90 95 

Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 
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100 



7V *1 -j 

Axa 


Asp 


ax a 


VaX xxe 


Leu 


ax a Tnr 






TIC 






1 O ft 

XzU 




Val 


Gly 


Ser Gly 


GXU 


Gly Ser 












Xob 


Ser 


AX a 


Cys 


Ala val 


cys 


Asp Gly 










XbU 




Pro 


Leu 


Ala 


Val He 


Gly 


Gly Gly 








165 






Phe 


Leu 


Thr 


Lys Tyr 


Gly 


Ser Lys 








XoO 






TV 7 — 

Ala 


Pne 


Arg 


Ala Ser 


Lys 


He Met 






195 






200 


Lys 


ixe 


TV mw* 

ASp 


val xxe 


Trp 


Asn Ser 




zxu 










GXy 


GXU 


Arg 


Asp Val 


Leu 


Gly Gly 










O O ft 

£ j U 




Gly 


Asp 


val 


Ser Asp 


Leu 


Lys Val 








245 






His 


Glu 


Pro 


Ala Thr 


Lys 


Phe Leu 








260 




Asp 


Gly 


Tyr 


Val Val 


Thr 


Lys Pro 






275 






280 


Gly 


Val 


Phe 


Ala Ala 


Gly 


Asp Val 




290 








295 


He 


Thr 


Ala 


Ala Gly 


Thr 


Gly Cys 


305 








310 




Tyr 


Leu 


Gin 


Glu He 


Gly 


Ser Gin 








325 





105 110 
Gly Ala Val Ala Lys Arg Leu Ser 
125 

Gly Gly Phe Trp Asn Arg Gly He 
140 

Ala Ala Pro He Phe Arg Asn Lys 
155 160 
Asp Ser Ala Met Glu Glu Ala Asn 

170 175 
Val Tyr He He His Arg Arg Asp 
185 190 
Gin Gin Arg Ala Leu Ser Asn Pro 
205 

Ser Val Val Glu Ala Tyr Gly Asp 
220 

Leu Lys Val Lys Asn Val Val Thr 
235 240 
Ser Gly Leu Phe Phe Ala He Gly 

250 255 
Asp Gly Gly Val Glu Leu Asp Ser 
265 270 
Gly Thr Thr Gin Thr Ser Val Pro 
285 

Gin Asp Lys Lys Tyr Arg Gin Ala 
300 

Met Ala Ala Leu Asp Ala Glu His 
315 ' 320 

Gin Gly Lys Ser Asp 
330 



<210> 12 
<211> 332 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 12 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 

1 5 10 * 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 

35 40 45 

Gly Gin Leu Asn Gin Pro Pro Arg Glu Asn Phe Pro Gly Phe Pro Glu 

50 55 60 

Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 
65 '70 75 80 

Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 

85 90 95 

Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 

100 105 ~ 110 

Asp Ala Val He Leu Ala He Gly Ala Val Ala Lys Trp Leu Ser Phe 

115 120 125 

Val Gly Ser Gly Glu Val Leu Gly Gly Leu Trp Asn Arg Gly He Ser 

130 135 140 

Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn, Lys Pro 
145 150 155 160 

Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 

165 170 175 

Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He Asp Arg Arg Asp Ala 

180 185 190 

Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 

195 200 205 

He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 

210 215 220 

Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 
225 230 235 240 
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Asp Val 


Ser 


nop ucu 


Lys 


Val Ser 


Gly Leu 


Phe 


Phe Ala lie Glv His 






94 ^ 

A *t D 






250 




255 


Glu Pro 


Ala 


TV-it* TtVc» 
± xix ±-iy z> 


Phe 


JiPH A Q"TJ 

J—* \ — U AOk^ 


Gly Gly 


Val 


flln Tif=**n Astd Ser Ast) 






Z b U 






9 K 






Gly Tyr Val 




T iVC 

XI jr O 


Pro Gly 


Thr Thr 


Gin 


Tlrr S^t Val Prn Glv 




275 






9fl0 

A O V 






285 


Val Phe 


Ala 


Ala Gly 


Asp 


Val Gin 


Asp Lys 


Lys 


Tyr Arg Gin Ala lie 


290 








295 






300 


Thr Ala 


Ala 


Gly Thr 


Gly 


Cys Met 


Ala Ala 


Leu 


Asp Ala Glu His Tyr 


305 






310 






315 


320 


Leu Gin 


Glu 


lie Gly 


Ser 


Gin Gin 


Gly Lys 


Ser 


Asp 






325 






330 







<210> 13 
<211> 333 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<400> 13 



Met. Asn 


Gly 


Leu 


Glu 


Thr 


His Asn 


Thr Arg 


Leu 


Cys He Val Gly Ser 


l 






b 






1U 




1 c 
lb 


Gly Pro 


Ala 


Ala 


His 


Tnr 


Ala Ala 


lie lyr 


Ala 


Ala Arg Ala Glu Leu 






O ft 








9 C 






Lys Pro 


Leu 


Leu 


Pne 


Glu 


Gly Trp 


Met Ala 


TV <-» vt 

Asn 


Asp He Ala Pro Gly 




Jb 








A ft 






A C 

4b 


Gly Gin 


Leu 


Thr 


Thr 


Thr 


Thr Asp 


Val Glu 


Asn 


Phe Pro Gly Phe Pro 


















ovJ 


Glu Gly 


lie 


Leu 


Gly 


Val 


Glu Leu 


Thr Asp 


Lys 


Phe Arg Lys Gin Ser 


65 








70 






7b 


O A 

80 


Glu Arg 


Pne 


Gly 


Thr 


Thr 


lie Pne 


Thr Glu 


Thr 


Val Tnr Lys Val Asp 












90 




95 


Phe Ser 


Ser 


Lys 


Pro 


Phe 


Lys Leu 


Phe Thr 


Asp 


Ser Lys Ala He Leu 






100 








105 




110 


Ala Asp 


Ala 


Val 


He 


Leu 


Ala Thr 


Gly Ala 


Val 


Ala Lys Arg Leu Ser 




115 








120 






125 


Phe Val 


Gly 


Ser 


Gly 


Glu 


Gly Ser 


Gly Gly 


Phe 


Trp Asn Arg Gly He 


130 










135 






140 


Ser Ala 


Cys 


Ala 


Val 


Cys 


Asp Gly 


Ala Ala 


Pro 


lie Phe Arg Asn Lys 


145 








150 






155 


160 


Pro Leu 


Ala 


Val 


He 


Gly 


Gly Gly 


Asp Ser 


Ala 


Met Glu Glu Ala Asn 








165 






170 




175 


Phe Leu 


Thr 


Lys 


Tyr 


Gly 


Ser Lys 


Val Tyr 


lie 


He His Arg Arg Asp 






180 








185 




190 


Ala Phe 


Arg 


Ala 


Ser 


Lys 


He Met 


Gin Gin 


Arg 


Ala Leu Ser Asn Pro 




195 








200 






205 


Lys lie 


Asp 


Val 


He 


Trp 


Asn Ser 


Ser Val 


Val 


Glu Ala Tyr Gly Asp 


210 










215 






220 


Gly Glu 


Arg 


Asp 


Val 


Leu 


Gly Gly 


Leu Lys 


Val 


Lys Asn Val Val Thr 


225 








230 






235 


240 


Gly Asp 


Val 


Ser 


Asp 


Leu 


Lys Val 


Ser Gly 


Leu 


Phe Phe Ala He Gly 








245 






250 




255 


His Glu 


Pro 


Ala 


Thr 


Lys 


Phe Leu 


Asp Gly 


Gly 


Val Glu Leu Asp Ser 






260 








265 




270' 


Asp Gly 


Tyr 


Val 


Val 


Thr 


Lys Pro 


Gly Thr 


Thr 


Gin Thr Ser Val Pro 




275 








280 






285 


Gly Val 


Phe 


Ala 


Ala 


Gly 


Asp Val 


Gin Asp 


Lys 


Lys Tyr Arg Gin Ala 


290 










295 






300 


lie Thr 


Ala 


Ala 


Gly 


Thr 


Gly Cys 


Met Ala 


Ala 


Leu Asp Ala Glu His 


305 






310 






315 


320 


Tyr Leu 


Gin 


Glu 


He 


Gly 


Ser Gin 


Gin Gly 


Lys 


Ser Asp 



325 330 
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<210> 14 
<211> 3129 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1555) ... (1899) 
<223> Chimeric 
<400> 14 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 3 00 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt cttttggttc 1080 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 1140 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
catgtatgtc taaatgccat gcaaagcaac acgtgcttaa catgcacttt aaatggctca 1260 
cccatctcaa cccacacaca aacacattgc ctttttcttc atcatcacca caaccacctg 1320 
tatatattca ttctcttccg ccacctcaat ttcttcactt caacacacgt caacctgcat 1380 
atgcgtgtca tcccatgccc aaatctccat gcatgttcca accaccttct ctcttatata 1440 
atacctataa atacctctaa tatcactcac ttctttcatc atccatccat ccagagtact 1500 
actactctac tactataata ccccaaccca actcatattc aatactactc tact atg 1557 

Met 
1 

cac acc gtt gag aca tgg 1605 
His Thr Val Glu Thr Trp 
15 

aaa act ctt gtg gtg gtt 1653 
Lys Thr Leu Val Val Val 
30 

cgt ttc ate get cca ttc 1701 
Arg Phe lie Ala Pro Phe 
45 

gtg ctt ttc etc aag gtt 1749 
Val Leu Phe Leu Lys Val 
60 65 

gat tgg gcg at a cag gcg 1797 
Asp Trp Ala lie Gin Ala 
80 

aag att ttg gac aaa gtt 1845 
Lys He Leu Asp Lys Val 
95 

acc att gec aaa cac ttg i893 
Thr He Ala Lys His Leu 



get teg gaa gaa gga caa gtg ate gee tgc 
Ala Ser Glu Glu Gly Gin Val He Ala Cys 
5 10 

aac gag cag ctt cag aag get aat gaa tec 
Asn Glu Gin- Leu Gin Lys Ala Asn Glu Ser 
20 25 

gat ttc acg get tct tgg tgt gga cca tgt 
Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys 
35 40 

ttt get gat ttg get aag aaa ctt cct aac 
Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn 
50 55 

gat act gat gaa ttg aag teg gtg gca agt 
Asp Thr Asp Glu Leu Lys Ser Val Ala Ser 
70 75 

atg cca acc ttc atg ttt ttg aag gaa ggg 
Met Pro Thr Phe Met Phe Leu Lys Glu Gly 
85 90 

gtt gga gee aag aaa gat gag ctt cag tct 
Val Gly Ala Lys Lys Asp Glu Leu Gin Ser 
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100 105 110 

get taa gcttaataag tatgaactaa aatgcatgta ggtgtaagag ctcatggaga 1949 
Ala * 



gcatggaata ttgtatccga ccatgtaaca gtataataac tgagctccat ctcacttctt 2009 
ctatgaataa acaaaggatg ttatgatata ttaacactct atetatgeae cttattgttc 2069 
tatgataaat ttcctcttat tattataaat catctgaatc gtgaeggett atggaatgct 2129 
tcaaatagta caaaaacaaa tgtgtactat aagactttct aaacaattct aactttagca 2189 
ttgtgaacga gacataagtg ttaagaagac ataacaatta taatggaaga agtttgtctc 2249 
catttatata ttatatatta cccacttatg tattatatta ggatgttaag gagacataac 2309 
aattataaag agagaagttt gtatccattt atatattata tactacccat ttatatatta 2369 
tacttatcca cttatttaat gtctttataa ggtttgatcc atgatatttc taatatttta 2429 
gttgatatgt atatgaaagg gtactatttg aactctctta ctctgtataa aggttggatc 2489 
atccttaaag tgggtctatt taattttatt gcttcttaca gataaaaaaa aaattatgag 2549 
ttggtttgat aaaatattga aggatttaaa ataataataa ataataaata acatataata 2609 
tatgtatata aatttattat aatataacat ttatctataa aaaagtaaat attgtcataa 2669 
atctatacaa tegtttagee ttgctggacg actctcaatt atttaaacga gagtaaacat 2729 
atttgacttt ttggttattt aacaaattat tatttaacac tatatgaaat tttttttttt 2789 
tateggcaag gaaataaaat taaattagga gggacaatgg tgtgtcccaa tccttataca 2849 
accaacttcc acaggaaggt caggtegggg acaacaaaaa aacaggcaag ggaaattttt 2909 
taatttgggt tgtcttgttt getgeataat ttatgcagta aaacactaca cataaccctt 2969 
ttagcagtag agcaatggtt gaccgtgtgc ttagcttctt ttattttatt tttttatcag 3029 
caaagaataa ataaaataaa atgagacact tcagggatgt ttcaaccctt atacaaaacc 3089 
ccaaaaacaa gtttcctagc accctaccaa ctaaggtacc 3129 

<210> 15 
<211> 114 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<400> 15 




Met Ala 


Ser 


Glu Glu Gly Gin Val 


1 




5 


Trp Asn 


Glu 


Gin Leu Gin Lys Ala 






20 


Val Asp 


Phe 


Thr Ala' Ser Trp Cys 




35 


40 


Phe Phe 


Ala 


Asp Leu Ala Lys Lys 


50 




55 


Val Asp 


Thr 


Asp Glu Leu Lys Ser 


65 




70 


Ala Met 


Pro 


Thr Phe Met Phe Leu 






85 


Val Val 


Gly 


Ala Lys Lys Asp Glu 






100 


Leu Ala 







He Ala Cys His Thr Val Glu Thr 

10 15 
Asn Glu Ser Lys Thr Leu Val Val 
25 30 
Gly Pro Cys Arg Phe He Ala Pro 
45 

Leu Pro Asn Val Leu Phe Leu Lys 
60 

Val Ala Ser Asp Trp Ala He Gin 

75 80 
Lys Glu Gly Lys He Leu Asp Lys 

90 95 
Leu Gin Ser Thr He Ala Lys His 
105 110 



<210> 16 
<211> 3888 
<212> DNA 

<213> Artifcial sequence 
<220> 

<223> Chimeric 
<221> CDS 

<222> (1555) . . . (1907) 
<221> CDS 

<222> (2148) . , . (2659) 
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<400> 16 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 300 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt .aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt cttttggttc 1080 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 1140 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
catgtatgtc taaatgccat gcaaagcaac acgtgcttaa catgcacttt aaatggctca 1260 
cccatctcaa cccacacaca aacacattgc ctttttcttc atcatcacca caaccacctg 1320 
tatatattca ttctcttccg ccacctcaat ttcttcactt caacacacgt caacctgcat 1380 
atgcgtgtca tcccatgccc aaatctccat gcatgttcca accaccttct ctcttatata 1440 
atacctataa atacctctaa tatcactcac ttctttcatc atccatccat ccagagtact 1500 
actactctac tactataata ccccaaccca actcatattc aatactactc tact atg 1557 

Met 
1 

gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 1605 
Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin 
5 10 15 



tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 1653 
Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 
20 25 30 

gga tct gac tac tec aag tct agg cag att get aaa get gca act get 1701 
Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala 
35 ~ 40 45 



gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 1749 
Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 
50 55 ■ 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 1797 
Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 
70 75 80 



age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 1845 
Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 
85 90 95 

ggt ttt ctt tec tct gga ggg ttt ggc att gee get ata acc gtt ttc 1893 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr, Val Phe 
100 105 110 



tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 1947 
Ser Trp He Tyr Lys 
115 



atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2007 

atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2067 

acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2127 

tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 2178 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 



-9- 



WO 02/50289 



PCT/US01/50240 



120 125 

gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 2226 
Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 " 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 2274 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
145 150 - 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg get teg 2322 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Ala Ser 
165 170 175 

gaa gaa gga caa gtg ate gee tgc cac acc gtt gag aca tgg aac gag 2370 
Glu Glu Gly Gin Val lie Ala Cys His Thr Val Glu Thr Trp Asn Glu 
180 185 190 

cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg gtt gat ttc 2418 
Gin Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val Val Asp Phe 
195 200 205 

acg get tct tgg tgt gga cca tgt cgt ttc ate get cca ttc ttt get 2466 
Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe He Ala Pro Phe Phe Ala 
210 215 220 

gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag gtt gat act 2514 
Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val Asp Thr 
225 230 235 " 240 

gat gaa ttg aag teg gtg gca agt gat tgg gcg at a cag gcg atg cca 2562 
Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala He Gin Ala Met Pro 
245 " 250 255 

acc ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa gtt gtt gga 2610 
Thr Phe Met Phe Leu Lys Glu Gly Lys He Leu Asp Lys Val Val Gly 
260 265 270 

gee aag aaa gat gag ctt cag tct acc att gee aaa cac ttg get taa 2658 
Ala Lys Lys Asp Glu Leu Gin Ser Thr He Ala Lys His Leu Ala * 
275 280 ~ 285 

g cttaataagt atgaactaaa atgcatgtag gtgtaagagc tcatggagag 2709 
catggaatat tgtatccgac catgtaacag tataataact gagctccatc tcacttcttc 2769 
tatgaataaa caaaggatgt tatgatatat taacactcta tctatgcacc ttattgttct 2829 
atgataaatt tcctcttatt attataaatc atctgaatcg tgaeggctta tggaatgett 2889 
caaatagtac aaaaacaaat gtgt aetata agactttcta aacaattcta actttagcat 2949 
tgtgaacgag acataagtgt taagaagaca taacaattat aatggaagaa gtttgtctcc 3009 
atttatatat tatatattac ccacttatgt attatattag gatgttaagg agacataaca 3069 
attataaaga gagaagtttg tatccattta tatattatat actacccatt tatatattat 3129 
acttatccac ttatttaatg tctttataag gtttgatcca tgatatttct aatattttag 3189 
ttgatatgta tatgaaaggg tactatttga actctcttac tctgtataaa ggttggatca 3249 
tccttaaagt gggtctattt aattttattg cttcttacag ataaaaaaaa aattatgagt 3309 
tggtttgata aaatattgaa ggatttaaaa taataataaa taataaataa catataatat 3369 
atgtatataa atttattata atataacatt tatctataaa aaagtaaata ttgtcataaa 3429 
tctatacaat cgtttagcct tgctggacga ctctcaatta tttaaacgag agtaaacata 3489 
tttgactttt tggttattta acaaattatt atttaacact atatgaaatt tttttttttt 3549 
ateggcaagg aaataaaatt aaattaggag ggacaatggt gtgtcccaat ccttatacaa 3609 
ccaacttcca caggaaggtc aggtegggga caacaaaaaa acaggcaagg gaaatttttt 3669 
aatttgggtt gtcttgtttg ctgeataatt tatgeagtaa aacactacac ataacccttt 3729 
tagcagtaga gcaatggttg accgtgtgct tagcttcttt tattttattt ttttatcagc 3789 
aaagaataaa taaaataaaa tgagacactt cagggatgtt tcaaccctta tacaaaaccc 3849 
caaaaacaag tttcctagca ccctaccaac taaggtacc 3888 

<210> 17 
<211> 118 
<212> PRT 
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WO 02/50289 
<213> Artifcial sequence 



<400> 17 



neu 

■L 


/VJ-3. 


Asp 


Tnr Ala 


Arg Gly Thr His 




Tyr 


PrO 


5 

Mec Met 


Gly Arg Asp Arg 












Arg 




ser 


Asp Tyr 


Ser Lys Ser Arg 






-J 3 




a. n 


Ala 


vaj. 


Thr 


Ala c^xy 


Gly Ser Leu Leu 




50 




55 


Val 


Gly 


Thr 


Val lie 


Ala Leu Thr Val 


65 








70 


Phe 


Ser 


Pro 


He Leu 


Val Pro Ala Leu 








85 




Thr 


Gly 


Phe 


Leu Ser 


Ser Gly Gly Phe 








100 


105 


Phe 


Ser 


Trp 


He Tyr 


Lys 






115 







<210> 18 
<211> 169 
<212> PRT 

<213> Artifcial sequence 
<400> 18 



Tyr Ala 
1 


Thr 


Gly Glu 
5 


His 


Pro Gin Gly 


Arg Met 


Lys 


Leu Gly 


Ser 


Lys Ala Gin 






20 




25 


Tyr Tyr 


Gly 


Gin Gin 


His 


Thr Gly Gly 




35 






40 


Arg Gly 


Gly 


Gin His 


Thr 


Thr Met Ala 


50 








55 


Ala Cys 


His 


Thr Val 


Glu 


Thr Trp Asn 


65 






70 


Glu Ser 


Lys 


Thr Leu 


Val 


Val Val Asp 






85 




Pro Cys 


Arg 


Phe He 


Ala 


Pro Phe Phe 






100 




105 


Pro Asn 


Val 


Leu Phe 


Leu' 


Lys Val Asp 




115 






120 


Ala Ser 


Asp 


Trp Ala 


He 


Gin Ala Met 


130 








135 


Glu Gly 


Lys 


He Leu 


Asp 


Lys Val Val 


145 






150 




Gin Ser 


Thr 


He Ala 


Lys 


His Leu Ala 






165 





PCT/US01/50240 



His Asp He He Gly Arg Asp 
10 15 
Asp Gin Tyr Gin Met Ser Gly 
30 

Gin He Ala Lys Ala Ala Thr 
45 

Val Leu Ser Ser Leu Thr Leu 
60 

Ala Thr Pro Leu Leu Val He 

75 80 
He Thr Val Ala Leu Leu lie 
90 95 
Gly He Ala Ala He Thr Val 
110 



Ser Asp Lys Leu Asp Ser Ala 
10 15 
Asp Leu Lys Asp Arg Ala Gin 
30 

Glu His Asp Arg Asp Arg Thr 
45 

Ser Glu Glu Gly Gin Val He 
60 

Glu Gin Leu Gin Lys Ala Asn 

75 * 80 

Phe Thr Ala Ser Trp Cys Gly 
90 95 
Ala Asp Leu Ala Lys Lys Leu 
110 

Thr Asp Glu Leu Lys Ser Val 

125 

Pro Thr Phe Met Phe Leu Lys 
140 

Gly Ala Lys Lys Asp Glu Leu 
155 160 



<210> 19 
<211> 3888 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<221> CDS 

<222> (1555) . . . (2249) 
<221> CDS 

<222> (2490) . . . (2658) 
<400> 19 , 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
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tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 300 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt ctttfcggttc 1080 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 1140 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
catgtatgtc taaatgccat gcaaagcaac acgtgcttaa catgcacttt aaatggctca 1260 
cccatctcaa cccacacaca aacacattgc ctttttcttc atcat caeca caaccacctg 1320 
tatatattca ttctcttccg ccacctcaat ttcttcactt caacacacgt caacctgcat 1380 
atgcgtgtca tcccatgccc aaatctccat gcatgttcca accaccttct ctcttatata 1440 
atacctataa atacctctaa tatcactcac ttctttcatc atccatccat ccagagtact 1500 
actactctac tactataata ccccaaccca actcatattc aatactactc tact atg 1557 

Met 
1 

get teg gaa gaa gga caa gtg ate gee tgc cac ace gtt gag aca tgg 1605 
Ala Ser Glu Glu Gly Gin Val lie Ala Cys His Thr Val Glu Thr Trp 
5 10 15 

aac gag cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg gtt 1653 
Asn Glu Gin Leu Gin Lya Ala Asn Glu Ser Lys Thr Leu Val Val Val 
20 25 ^ 30 

gat ttc acg get tct tgg tgt gga cca tgt cgt ttc ate get cca ttc 1701 
Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe lie Ala Pro Phe 
35 40 " 45 

ttt get gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag gtt 1749 
Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val 
50 55 60 65 

gat act gat gaa ttg aag teg gtg gca agt gat tgg gcg ata cag gcg 1797 
Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala lie Gin Ala 
70 75 ~ 80 

atg cca ace ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa gtt 1845 
Met Pro Thr Phe Met Phe Leu Lys Glu Gly Lys lie Leu Asp Lys Val 
85 90 "* 95 

gtt gga gee aag aaa gat gag ctt cag tct ace att gec aaa cac ttg 1893 
Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr lie Ala Lys His Leu 
100 105 110 

get atg gcg gat aca get aga gga ace cat cac gat ate ate ggc aga 1941 
Ala Met Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg 
115 120 125 

gac cag tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec 1989 
Asp Gin Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser 
130 135 140 145 

gga cga gga tct gac tac tec aag tct agg cag att get aaa get gca 3037 
Gly Arg Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala 
150 155 160 
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act get gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc 2085 
Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr 
165 170 175 

ctt gtt gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt 2133 
Leu Val Gly Thr Val lie Ala Leu Thr Val Ala Thr Pro Leu Leu Val 
180 185 190 

ate ttc age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc 2181 
lie Phe Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu 
195 200 205 

ate acc ggt ttt ctt tec tct gga ggg ttt ggc att gee get ata acc 2229 
He Thr Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr 
210 215 220 225 

gtt ttc tct tgg att tac aa gtaagcacac atttatcatc ttacttcata 2279 
Val Phe Ser Trp He Tyr Lys 
230 

attttgtgca atatgtgc'at gcatgtgttg agecagtage tttggatcaa tttttttggt 2339 
cgaataacaa atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat 2399 
acgaaatttg acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg 2459 
cttggtatga tacctattga ttgtgaatag g tac gca acg gga gag cac cca 2511 

Tyr Ala Thr Gly Glu His Pro 
235 

cag gga tea gac aag ttg gac agt gca agg atg aag ttg gga age aaa 2559 
Gin Gly Ser Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys 
240 " " 245 250 255 

get cag gat ctg aaa gac aga get cag tac tac gga cag caa cat act 2607 
Ala Gin Asp Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr 
260 * ~ 265 270 

ggt ggg gaa cat gac cgt gac cgt act cgt ggt ggc cag cac act act 2655 
Gly Gly Glu His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr 
275 280 285 

taa gcttaataag tatgaactaa aatgcatgta ggtgtaagag ctcatggaga 2708 
* 



gcatggaata ttgrtatccga ccatgtaaca 
ctatgaataa acaaaggatg ttatgatata 
tatgataaat ttcctcttat tattataaat 
tcaaatagta caaaaacaaa tgtgtactat 
ttgtgaacga gacataagtg ttaagaagac 
catttatata ttatatatta cccacttatg 
aattataaag agagaagttt gtatccattt 
tacttatcca cttatttaat gtctttataa 
gttgatatgt atatgaaagg gtactatttg 
atccttaaag tgggtctatt taattttatt 
ttggtttgat aaaatattga aggatttaaa 
tatgtatata aatttattat aatataacat 
atctatacaa tegtttagee ttgctggacg 
atttgacttt ttggttattt aacaaattat 
tateggcaag gaaataaaat taaattagga 
accaacttcc acaggaaggt caggtegggg 
taatttgggt tgtcttgttt getgeataat 
ttagcagtag agcaatggtt gaccgtgtgc 
caaagaataa ataaaataaa atgagacact 
ccaaaaacaa gtttcctagc accctaccaa 

<210> 20 
<211> 232 



gtataataac tgagctccat ctcacttctt 2768 
ttaacactct atetatgeae cttattgttc 2828 
catctgaatc gtgaeggett atggaatgct 2888 
aagactttct aaacaattct aactttagca 2948 
ataacaatta taatggaaga agtttgtctc 3008 
tattatatta ggatgttaag gagacataac 3068 
atatattata tactacccat ttatatatta 3128 
ggtttgatcc atgatatttc taatatttta 3188 
aactctctta ctctgtataa aggttggatc 3248 
gcttcttaca gataaaaaaa aaattatgag 3308 
ataataataa ataataaata acatataata 3368 
ttatctataa aaaagtaaat attgtcataa 3428 
actctcaatt atttaaacga gagtaaacat 3488 
tatttaacac tatatgaaat tttttttttt 3548 
gggacaatgg tgtgtcccaa tccttataca 3608 
acaacaaaaa aacaggcaag ggaaattttt 3668 
ttatgcagta aaacactaca cataaccctt 3728 
ttagcttctt ttattttatt tttttatcag 3788 
tcagggatgt ttcaaccctt atacaaaacc 3848 
ctaaggtacc 3888 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 

<400> 20 



Met 


Ala 


Ser 


Glu Glu Gly 


Gin Val 


He Ala Cys His Thr 


Val Glu Thr 


1 






5 




10 


15 


Trp 


Asn 


Glu 


Gin Leu Gin 


Lys Ala 


Asn Glu Ser Lys Thr 


Leu Val Val 








20 


25 


30 


Val 


Asp 


Phe 


Thr Ala Ser 


Trp Cvs 


Gly Pro Cys Arg Phe 


He Ala Pro 






35 




40 


45 




Phe 


Phe 


Ala 


Asp Leu Ala 


Lvs Lvs 


Leu Pro Asn Val Leu 


Phe Leu Lys 




50 






55 


60 




Val 


Asp 


Thr 


Asp Glu Leu 


Lys Ser 


Val Ala Ser Asp Trp 


Ala He Gin 


65 






70 




75 


80 


Ala 


Met 


Pro 


Thr Phe Met 


Phe Leu 


Lys Glu Gly Lys He 


Leu Asp Lys 








85 




90 


95 


Val 


Val 


Gly 


Ala Lys Lys 


Asp Glu 


Leu Gin Ser Thr He 


Ala Lys His 








100 




105 


110 


Leu 


Ala 


Met 


Ala Asp Thr 


Ala Arg 


Gly Thr His His Asp 


He He Gly 






115 




120 


125 




Arcr 


Asp 


Gin 


Tyr Pro Met 


Met Gly 


Arg Asp Arg Asp Gin 


Tyr Gin Met 




130 






135 


140 




Ser 


Gly 


Arg 


Gly Ser Asp 


Tyr Ser 


Lys Ser Arg Gin He 


Ala Lys Ala 


145 






150 




155 


160 


Ala 


Thr 


Ala 


Val Thr Ala 


Gly Gly 


Ser Leu Leu Val Leu 


Ser Ser Leu 








165 


170 


175 


Thr 


Leu 


Val 


Gly Thr Val 


He Ala 


Leu Thr Val Ala Thr 


Pro Leu Leu 








180 




185 


190 


Val 


He 


Phe 


Ser Pro He 


Leu Val 


Pro Ala Leu He Thr 


Val Ala Leu 






195 




200 


205 




Leu 


He 


Thr 


Gly Phe Leu 


Ser Ser 


Gly Gly Phe Gly He 


Ala Ala He 




210 






215 


220 




Thr 


Val 


Phe 


Ser Trp He 


Tyr Lys 






225 






230 









<210> 21 
<211> 55 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 21 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Asp Ser Ala 

15 10 15 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 

20 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg Asp Arg Thr 

35 40 ~ 45 

Arg Gly Gly Gin His Thr Thr 
50 55 



<210> 22 

<211> 3787 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 

<221> CDS 
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<222> (1555) . . . (2556) 
<400> 22 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 300 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt cttttggttc 1080 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 1140 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
catgtatgtc taaatgccat gcaaagcaac acgtgcttaa catgcacttt aaatggctca 1260 
cccatctcaa cccacacaca aacacattgc ctttttcttc atcatcacca caaccacctg 1320 
tatatattca ttctcttccg ccacctcaat ttcttcactt caacacacgt caacctgcat 1380 
atgcgtgtca tcccatgccc aaatctccat gcatgttcca accaccttct ctcttatata 1440 
atacctataa atacctctaa tatcactcac ttctttcatc atccatccat ccagagtact 1500 
actactctac tactataata ccccaaccca actcatattc aatactactc tact atg 1557 

Met 
1 

aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc 1605 
Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser Gly 
5 10 15 

cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa 1653 
Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu Lys 
20 25 30 

cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt 1701 
Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly Gly 
35 40 45 

caa eta aca ace ace acc gac gtc gag aat ttc ccc gga ttt cca gaa 1749 
Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Glu 
50 55 60 65 

ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag 1797 
Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 
70 75 80 

cga ttc ggt act acg at a ttt aca gag acg gtg acg aaa gtc gat ttc 1845 
Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 
85 90 95 

tct teg aaa ccg ttt aag eta ttc aca gat -tea aaa gee att etc get 1893 
Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 
100 105 110 

gac get gtg att etc get act gga get gtg get aag egg ctt age ttc 1941 
Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe 
115 120 125 

gtt gga tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec 1989 
Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser 
130 135 140 145 
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get tgt get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct 2037 
Ala Cys Ala Val Cys Asp Gly Ala Ala Pro lie Phe Arg Asn Lys Pro 
150 ~ 155 160 



ctt gcg gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt 
Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 
165 " 170 175 



2085 



ctt aca aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get 
Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala 
180 185 190 



2133 



ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct aag 
Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 
195 200 205 



2181 



att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga 
He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 
210 215 220 225 



2229 



gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga 
Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 
230 235 240 



2277 



gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat 
Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His 
245 250 255 



2325 



gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat 
Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp 
260 ~ 265 ~ 270 



2373 



ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga 
Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly 
275 280 285 



2421 



gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gec ate 
Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He 
290 " 295 300 305 



2469 



act get gca gga act ggg tgc atg gca get ttg gat gca gag cat tac 
Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr 
310 315 320 



2517 



tta caa gag att gga tct cag caa ggt aag agt gat tga agcttaataa 
Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp * 
325 330 



2566 



gtatgaacta 
accatgtaac 
gttatgatat 
ttattataaa 
atgtgtacta 
gttaagaaga 
acccacttat 
tgtatccatt 
tgtctttata 
ggtactattt 
ttaattttat 
aaggatttaa 
taatataaca 
ettgetggae 
taacaaatta 
ttaaattagg 
teaggteggg 
tgctgcataa 



aaatgcatgt 
agtataataa 
attaacactc 
tcatctgaat 
taagactttc 
cataacaatt 
gtattatatt 
tatatattat 
aggtttgatc 
gaactctctt 
tgettcttae 
aataataata 
tttatctata 
gactctcaat 
ttatttaaca 
agggacaatg 
gacaacaaaa 
tttatgcagt 



aggtgtaaga 
c t gage t cca 
tatctatgea 
cgtgacggct 
taaacaattc 
ataatggaag 
aggatgttaa 
atactaccca 
catgatattt 
actctgtata 
agataaaaaa 
aataataaat 
aaaaagtaaa 
tatttaaacg 
ctatatgaaa 
gtgtgtccca 
aaacaggcaa 
aaaacactac 



gctcatggag 
tctcacttct 
ccttattgtt 
tatggaatgc 
taactttagc 
aagtttgtct 
ggagacataa 
tttatatatt 
ctaatatttt 
aaggttggat 
aaaattatga 
aacatataat 
tattgtcata 
agagtaaaca 
tttttttttt 
atccttatac 
gggaaatttt 
acataaccct 



agcatggaat 
tctatgaata 
ctatgataaa 
ttcaaatagt 
attgtgaacg 
ccatttatat 
caattataaa 
atacttatcc 
agttgatatg 
catccttaaa 
gttggtttga 
atatgtatat 
aatctataca 
tatttgactt 
ttatcggcaa 
aaccaacttc 
ttaatttggg 
tttagcagta 



attgtatccg 
aacaaaggat 
tttcctctta 
acaaaaaeaa 
agacataagt 
attatatatt 
gagagaagtt 
acttatttaa 
tatatgaaag 
gtgggtctat 
taaaatattg 
aaatttatta 
at cgt tt age 
tttggttatt 
ggaaataaaa 
cacaggaagg 
ttgtcttgtt 
gagcaatggt 



2626 
2686 
2746 
2806 
2866 
2926 
2986 
3046 
3106 
3166 
3226 
3286 
3346 
3406 
3466 
3526 
3586 
3646 
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tgaccgtgtg cttagcttct tttattttat ttttttatca gcaaagaata aataaaataa 3 706 
aatgagacac ttcagggatg tttcaaccct tatacaaaac cccaaaaaca agtttcctag 3766 
caccctacca actaaggtac c 3787 

<210> 23 
<211> 333 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<400> 23 



Met 


Asn Gly 


Leu Glu 


Thr 


His Asn 


Thr Arg 


Leu Cys He Val Gly 


Ser 


1 


5 






10 


15 




Gly 


Pro Ala 


Ala His 


Thr 


Ala Ala 


He Tyr 


Ala Ala Arg Ala Glu 


Leu 




20 






25 


30 




Lys 


Pro Leu 


Leu Phe 


Glu 


Gly Trp 


Met Ala 


Asn Asp He Ala Pro 


Gly 


35 






40 




45 




Gly 


Gin Leu 


Thr Thr 


Thr 


Thr Asp 


Val Glu 


Asn Phe Pro Gly Phe 


Pro 


50 






55 




60 




Glu 


Gly He 


Leu Gly 


Val 


Glu Leu 


Thr Asp 


Lys Phe Arg Lys Gin 


Ser 


65 






70 






75 


80 


Glu 


Arg Phe 


Gly Thr 


Thr 


He Phe 


Thr Glu 


Thr Val Thr Lys Val 


Asp 




85 






90 


95 




Phe 


Ser Ser 


Lys Pro 


Phe 


Lys Leu 


Phe Thr 


Asp Ser Lys Ala He 


Leu 






100 






105 


110 




Ala 


Asp Ala 


Val He 


Leu 


Ala Thr 


Gly Ala 


Val Ala Lys Arg Leu 


Ser 




115 






120 




125 




Phe 


Val Gly 


Ser Gly 


Glu 


Gly Ser 


Gly Gly 


Phe Trp Asn Arg Gly 


He 




130 






135 




140 




Ser 


Ala Cys 


Ala Val 


Cys 


Asp Gly 


Ala Ala 


Pro He Phe Arg Asn 


Lys 


145 






150 






155 


160 


Pro 


Leu Ala 


Val He 


Gly 


Gly Gly 


Asp Ser 


Ala Met Glu Glu Ala 


Asn 






165 






170 


175 




Phe 


Leu Thr 


Lys Tyr 


Gly 


Ser Lys 


Val Tyr 


He He His Arg Arg 


Asp 






180 






185 


190 




Ala 


Phe Arg 


Ala Ser 


Lys 


He Met 


Gin Gin 


Arg Ala Leu Ser Asn 


Pro 




195 




200 




205 




Lys 


He Asp 


Val He 


Trp 


Asn Ser 


Ser Val 


Val Glu Ala Tyr Gly 


Asp 




210 






215 




220 




Gly 


Glu Arg 


Asp Val 


Leu 


Gly Gly 


Leu Lys 


Val Lys Asn Val Val 


Thr 


225 






230 






235 


240 


Gly 


Asp Val 


Ser Asp 


Leu 


Lys Val 


Ser Gly 


Leu Phe Phe Ala He 


Gly 






245 






250 


255 




His 


Glu Pro 


Ala Thr 


Lys 


Phe Leu 


Asp Gly 


Gly Val Glu Leu Asp 


Ser 






260 






265 


270 




Asp 


Gly Tyr 


Val Val 


Thr 


Lys Pro 


Gly Thr 


Thr Gin Thr Ser Val 


Pro 




275 






280 




285 




Gly 


Val Phe 


Ala Ala 


Gly 


Asp Val 


Gin Asp 


Lys Lys Tyr Arg Gin 


Ala 




290 






295 




300 




He 


Thr Ala 


Ala Gly 


Thr 


Gly Cys 


Met Ala 


Ala Leu Asp Ala Glu 


His 


305 




310 




315 


320 


Tyr 


Leu Gin 


Glu He 


Gly 


Ser Gin 


Gin Gly 


Lys Ser Asp 








325 






330 







<210> 24 
<211> 4546 
<212> DMA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1555) . . . (1907) 
<221> CDS 
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<222> (2148) . . . (3315) 
<223> Chimeric 
<400> 24 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 300 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt cttttggttc 1080 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 1140 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
catgtatgtc taaatgccat gcaaagcaac acgtgcttaa catgcacttt aaatggctca 1260 
cccatctcaa cccacacaca aacacattgc ctttttcttc atcatcacca caaccacctg 1320 
tatatattca ttctcttccg ccacctcaat ttcttcactt caacacacgt caacctgcat 1380 
atgcgtgtca tcccatgccc aaatctccat gcatgttcca accaccttct ctcttatata 1440 
atacctataa atacctctaa tatcactcac ttctttcatc atccatccat ccagagtact 1500 
actactctac tactataata ccccaaccca actcatattc aatactactc tact atg 1557 

Met 
1 

gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 1605 
Ala Asp Thr Ala Arg Gly Thr His His Asp lie lie Gly Arg Asp Gin 
5 10 15 



tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 1653 
Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 
20 25 30 

gga tct gac tac tec aag tct agg cag att get aaa get gca act get 1701 
Gly Ser Asp Tyr Ser Lys Ser Arg Gin lie Ala Lys Ala Ala Thr Ala 
35 40 45 

gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 1749 
Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 
50 55 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 1797 
Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 
70 75 80 

age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 1845 
Ser Pro He Leu Val Pro Ala Leu lie Thr Val Ala Leu Leu lie Thr 
85 90 95' 



ggt ttt ctt tec tct gga ggg ttt ggc att gee get ata acc gtt ttc 1893 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe 
100 105 110 

tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 1947 
Ser Trp He Tyr Lys 
115 

atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2007 
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atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2067 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2127 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 2178 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
120 125 

gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 2226 
Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 2274 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
145 150 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act ace atg aat ggt 2322 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Asn Gly 
165 ~ 170 175 

etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc cca gcg 2370 
Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser Gly Pro Ala 
180 185 190 

gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa cct ctt 2418 
Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu Lys Pro Leu 
195 200 205 

etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt caa, eta 2466 
Leu Phe Glu> Gly Trp Met Ala Asn Asp lie Ala Pro Gly Gly Gin Leu 
210 215 220 

aca acc ace ace gac gtc gag aat ttc ccc gga ttt cca gaa ggt att 2514 
Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly- Phe Pro Glu Gly lie 
225 230 235 240 

etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag cga ttc 2562 
Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu Arg Phe 
245 250 255 

ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc tct teg 2610 
Gly Thr Thr lie Phe Thr Glu Thr Val Thr Lys Val Asp Phe Ser Ser 
260 265 270 

aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc get gac get 2658 
Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala lie Leu Ala Asp Ala 
275 280 285 

gtg att etc get act gga get gtg get aag egg ctt age ttc gtt gga 2706 
Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe Val Gly 
290 295 300 

tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec get tgt 2754 
Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser Ala Cys 
305 310 315 320 

get gtt tgc gac gga get get ccg ata ttc c*gt aac aaa cct ctt gcg 2802 
Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro, Leu Ala 
325 330 335 

gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt ctt aca 2850 
Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe Leu Thr 
340 ~ 345 350 

aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get ttt aga 2898 
Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala Phe Arg 
355 360 365 

gcg tct aag att atg cag cag cga get ttg tct aat cct aag att gat 2946 
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Ala Ser Lys lie Met Gin Gin Arg Ala Leu Ser Asn Pro Lys He Asp 
370 375 380 

gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga gaa aga 2994 
Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly Glu Arg 
385 390 395 * 400 

gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga gat gtt 3042 
Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly Asp Val 
405 410 415 

tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat gag cca 3090 
Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His Glu Pro 
420 425 430 

get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat ggt tat 3138 
Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp Gly Tyr 
435 440 445 

gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga gtt ttc 3186 
Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly Val Phe 
450 455 460 

get gcg ggt gat gtt cag gat aag aag tat agg caa gee ate act get 3234 
Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He Thr Ala 
465 470 475 480 

gca gga act ggg tgc atg gca get ttg gat gca gag cat tac tta caa 3282 
Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr Leu Gin 
485 490 495 

gag att gga tct cag caa ggt aag agt gat tga agcttaataa gtatgaacta 3335 
Glu He Gly Ser Gin Gin Gly Lys Ser Asp * 
500 505 

aaatgcatgt aggtgtaaga gctcatggag agcatggaat attgtatccg accatgtaac 3395 
agtataataa ctgagctcca tctcacttct tctatgaata aacaaaggat gttatgatat 3455 
attaacactc tatctatgea ccttattgtt ctatgataaa tttcctctta ttattataaa 3515 
tcatctgaat cgtgacggct tatggaatgc ttcaaatagt acaaaaacaa atgtgtacta 3575 
taagactttc taaacaattc taactttagc attgtgaacg agacataagt gttaagaaga 3635 
cataacaatt ataatggaag aagtttgtct ccatttatat attatatatt acccacttat 3695 
gtattatatt aggatgttaa ggagacataa caattataaa gagagaagtt tgtatccatt 3755 
tatatattat atactaccca tttatatatt atacttatcc acttatttaa tgtctttata 3815 
aggtttgatc catgatattt ctaatatttt agttgatatg tatatgaaag ggtactattt 3875 
gaactctctt actctgtata aaggttggat catccttaaa gtgggtctat ttaattttat 3935 
tgettcttae agataaaaaa aaaattatga gttggtttga taaaatattg aaggatttaa 3995 
aataataata aataataaat aacatataat atatgtatat aaatttatta taatataaca 4055 
tttatctata aaaaagtaaa tattgtcata aatctataca ategtttage ettgetggae 4115 
gactctcaat tatttaaacg agagtaaaca tatttgactt tttggttatt taacaaatta 4175 
ttatttaaca ctatatgaaa tttttttttt ttatcggcaa ggaaataaaa ttaaattagg 4235 
agggacaatg gtgtgtccca atccttatac aaccaacttc cacaggaagg teaggteggg 4295 
gacaacaaaa aaacaggcaa gggaaatttt ttaatttggg ttgtcttgtt tgctgcataa 4355 
tttatgcagt aaaacactac acataaccct tttagcagta gagcaatggt tgaccgtgtg 4415 
cttagcttct tttattttat ttttttatca gcaaagaata aataaaataa aatgagacac 4475 
ttcagggatg tttcaaccct tatacaaaac cccaaaaaca agtttcctag caccctacca 4535 
actaaggtac c 4546 

<210> 25 
<211> 118 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 25 

Met Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp 
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1 

1 






b 






v?in 


Tyr Pro 




jwec 


c^iy 


Arg Asp 






20 








Arg 


Gly Ser 


Asp 


Tyr 


Ser 


Lys Ser 




35 








40 


Ala 


Val Tnr 


Ala 


Gly 


Gly 


Ser Leu 




50 








55 


val 


Gly Thr 


val 


lie 


TV "1 _ 

Ala 


Leu Thr 


65 






70 




Phe 


Ser Pro 


He 


Leu 


Val 


Pro Ala 








85 






Thr 


Gly Phe 


Leu 


Ser 


Ser 


Gly Gly 






100 






Phe 


Ser Trp 


He 


Tyr 


Lys 






115 











10 15 
Arg Asp Gin Tyr Gin Met Ser Gly 
25 30 
Arg Gin He Ala Lys Ala Ala Thr 
45 

Leu Val Leu Ser Ser Leu Thr Leu 
60 

Val Ala Thr Pro Leu Leu Val He 

75 80 
Leu He Thr Val Ala Leu Leu He 

90 95 
Phe Gly He Ala Ala He Thr Val 
105 110 



<210> 26 
<211> 388 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<400> 26 



Tyr Ala 


Thr 


Gly 


Glu 


His 


Pro 


Gin Gly Ser Asp Lys 


Leu Asp Ser Ala 


1 






5 






10 


15 


Arg Met 


Lys 


Leu 


Gly 


Ser 


Lys 


Ala Gin Asp Leu Lys 


Asp Arg Ala Gin 






20 








25 


30 


Tyr Tyr 


Gly 


Gin 


Gin 


His 


Thr 


Gly Gly Glu His Asp 


Arg Asp Arg Thr 




35 










40 


45 


Arg Gly 


Gly 


Gin 


His 


Thr 


Thr 


Met Asn Gly Leu Glu 


Thr His Asn Thr 


50 










55 


60 






Cys 


XXc 




vjiy 


Ser 


Vj±y ±rrO AJ-ci Aid JtixS 


inr Aia /via lie 


65 








70 




75 


80 


Tyr Ala 


Ala 


Arg 


Ala 


Glu 


Leu 


Lys Pro Leu Leu Phe 


Glu Gly Trp Met 








85 






90 


95 


Ala Asn 


Asp 


He 


Ala 


Pro 


Gly 


Gly Gin Leu Thr Thr 


Thr Thr Asp Val 






100 








105 


110 


Glu Asn 


Phe 


Pro 


Gly 


Phe 


Pro 


Glu Gly lie Leu Gly 


Val Glu Leu Thr 




115 










120 


125 


Asp Lys 


Phe 


Arg 


Lys 


Gin 


Ser 


Glu Arg Phe Gly Thr 


Thr He Phe Thr 


130 










135 


140 




Glu Thr 


Val 


Thr 


Lys 


Val 


Asp 


Phe Ser Ser Lys Pro 


Phe Lys Leu Phe 


145 








150 




155 


160 


Thr Asp 


Ser 


Lys 


Ala 


He 


Leu 


Ala Asp Ala Val He 


Leu Ala Thr Gly 








165 






170 


175 


Ala Val 


Ala 


Lys 


Arg 


Leu 


Ser 


Phe Val Gly Ser Gly 


Glu Gly Ser Gly 






180 








185 


190 


Gly Phe 


Trp 


Asn 


Arg 


Gly 


He 


Ser Ala Cys Ala Val 


Cys Asp Gly Ala 




195 










200 


205 


Ala Pro 


He 


Phe 


Arg 


Asn 


Lys 


Pro Leu Ala Val He 


Gly Gly Gly Asp 


210 










215 


220 


Ser Ala 


Met 


Glu 


Glu 


Ala 


Asn 


Phe Leu Thr Lys Tyr 


Gly Ser Lys Val 


225 








230 




235 


240 


Tyr He 


He 


His 


Arg 


Arg 


Asp 


Ala Phe Arg Ala Ser 


Lys He Met Gin 








245 






250 


255 


(Jin Arg 


Ala 


Leu 


Ser 


Asn 


Pro 


Lys He Asp Val He 


Trp Asn Ser Ser 






260 








265 


270 


Val Val 


Glu 


Ala 


Tyr 


Gly 


Asp 


Gly Glu Arg Asp Val 


Leu Gly Gly Leu 




275 










280 


285 


Lys Val 


Lys 


Asn 


Val 


Val 


Thr 


Gly Asp Val Ser Asp 


Leu Lys Val Ser 


290 










295 


300 




Gly Leu 


Phe 


Phe 


Ala 


He 


Gly 


His Glu Pro Ala Thr 


Lys Phe Leu Asp 


305 








310 




315 


320 


Gly Gly 


Val 


Glu 


Leu 


Asp 


Ser 


Asp Gly Tyr Val Val 


Thr Lys Pro Gly 
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325 330 335 

Thr Thr Gin Thr Ser Val Pro Gly Val Phe Ala Ala Gly Asp Val Gin 

340 345 350 

Asp Lys Lys Tyr Arg Gin Ala lie Thr Ala Ala Gly Thr Gly Cys Met 

355 360 365 

Ala Ala Leu Asp Ala Glu His Tyr Leu Gin Glu lie Gly Ser Gin Gin 

370 375 380 

Gly Lys Ser Asp 
385 



<210> 27 
<211> 4545 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<221> CDS 

<222> (1555) . - . (2906) 
<221> CDS 

<222> (3147) . . . (3315) 
<400> 27 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 300 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt cttttggttc 1080 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 1140 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
catgtatgtc taaatgccat gcaaagcaac acgtgcttaa catgcacttt aaatggctca 1260 
cccatctcaa cccacacaca aacacattgc ctttttcttc atcatcacca caaccacctg 1320 
tatatattca ttctcttccg ccacctcaat ttcttcactt caacacacgt caacctgcat 1380 
atgcgtgtca tcccatgccc aaatctccat gcatgttcca accaccttct ctcttatata 1440 
atacctataa atacctctaa tatcactcac ttctttcatc atccatccat ccagagtact 1500 
actactctac tactataata ccccaaccca actcatattc aatactactc tact atg 1557 

Met 
1 

aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga, agt ggc 1605 
Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser Gly 
5 10 15 

cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa 1653 
Pro Ala Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu Lys 
20 25 30 

cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt 1701 
Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly Gly 
35 40 45 
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caa eta aca acc acc acc gac gtc gag aat ttc ccc gga ttt cca gaa 1749 
Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Glu 
50 55 60 65 

ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag " 1797 
Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 
70 75 80 

cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc 1845 
Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 
85 90 95 

tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc get 1893 
Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 
100 105 110 

gac get gtg att etc get act gga get gtg get aag egg ctt age ttc 1941 
Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe 
115 120 ' 125 

gtt gga tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec 1989 
Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser 
130 135 140 145 

get tgt get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct 2037 
Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro 
150 155 160 

ctt gcg gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt 2085 
Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 
165 170 175 

ctt aca aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get 2133 
Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala 
180 185 19 N 0 

ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct aag 2181 
Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 
195 * 200 205 

att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga 2229 
He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 
210 215 220 225 

gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga 2277 
Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 
230 ' 235 " 240 

gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat 2325 
Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His 
245 250 255 

gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat 2373 
Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp 
260 265 270 

ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga 2421 
Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly 
275 280 285 

gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gee ate 2469 
Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He 
290 295 " * 300 305 

act get gca gga act ggg tgc atg gca get ttg gat gca gag cat tac 2517 
Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr 
310 315 320 
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tta caa gag att gga tct cag caa ggt aag agt gat atg gcg gat aca 2565 
Leu Gin Glu lie Gly Ser Gin Gin Gly Lys Ser Asp Met Ala Asp Thr 
325 330 335 

get aga gga acc cat cac gat ate ate ggc aga gac cag tac ccg atg 2613 
Ala Arg Gly Thr His His Asp lie lie Gly Arg Asp Gin Tyr Pro Met 
340 345 350 

a tg ggc cga gac cga gac cag tac cag atg tec gga cga gga tct gac 2661 
Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg Gly Ser Asp 
355 360 365 

tac tec aag tct agg cag att get aaa get gca act get gtc aca get 2709 
Tyr Ser Lys Ser Arg Gin lie Ala Lys Ala Ala Thr Ala Val Thr Ala 
370 375 380 385 

99t ggt tec etc ctt gtt etc tec age ctt acc ctt gtt gga act gtc 2757 
Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val Gly Thr Val 
390 395 " 400 

at a get ttg act gtt gca aca cct ctg etc gtt ate ttc age cca ate 2805 
lie Ala Leu Thr Val Ala Thr Pro Leu Leu Val lie Phe Ser Pro lie 
405 410 415 

ctt gtc ccg get etc ate aca gtt gca etc etc ate acc ggt ttt ctt 2853 
Leu Val Pro Ala Leu lie Thr Val Ala Leu Leu lie Thr Gly Phe Leu 
420 425 430 

tec tct gga ggg ttt ggc att gee get ata acc gtt ttc tct tgg att 2901 
Ser Ser Gly Gly Phe Gly lie Ala Ala lie Thr Val Phe Ser Trp He 
435 440 445 

tac aa gtaagcacac atttatcatc ttacttcata attttgtgca atatgtgcat 2956 

Tyr Lys 

450 

gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa atgtaacaat 3016 
aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg acctagctag 3076 
cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga tacctattga 3136 
ttgtgaatag g tac gca acg gga gag cac cca cag gga tea gac aag ttg 3186 
Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu 
455 460 

gac agt gca agg atg aag ttg gga age aaa get cag gat ctg aaa gac 3234 
Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp 
465 470 * 475 480 

aga get cag tac tac gga cag caa cat act ggt ggg gaa cat gac cgt 3282 
Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg 
485 490 495 

gac cgt act cgt ggt ggc cag cac act act taa gcttaataag tatgaactaa 3335 
Asp Arg Thr Arg Gly Gly Gin His Thr Thr * 
500 ~ 505 

aatgcatgta ggtgtaagag ctcatggaga gcatggaata ttgtatccga ccatgtaaca 3395 
gtataataac tgagctccat ctcacttctt ctatgaataa acaaaggatg ttatgatata 3455 
ttaacactct atetatgeae cttattgttc tatgataaat ttcctcttat tattataaat 3515 
catctgaatc gtgaeggett atggaatgct tcaaatagta caaaaacaaa tgtgtactat 3575 
aagactttct aaacaattct aactttagca ttgtgaacga gacataagtg ttaagaagac 3635 
ataacaatta taatggaaga agtttgtctc catttatata ttatatatta cccacttatg 3695 
tattatatta ggatgttaag gagacataac aattataaag agagaagttt gtatccattt 3755 
atatattata tactacccat ttatatatta tacttatcca cttatttaat gtctttataa 3815 
ggtttgatcc atgatatttc taatatttta gttgatatgt atatgaaagg gtactatttg 3875 
aactctctta ctctgtataa aggttggatc atccttaaag tgggtctatt taattttatt 3935 
gcttcttaca gataaaaaaa aaattatgag ttggtttgat aaaatattga aggatttaaa 3995 
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ataataataa ataataaata acatataata 
ttatctataa aaaagtaaat attgtcataa 
actctcaatt atttaaacga gagtaaacat 
tatttaacac tatatgaaat tttttttttt 
gggacaatgg tgtgtcccaa tccttataca 
acaacaaaaa aacaggcaag ggaaattttt 
ttatgcagta aaacactaca cataaccctt 
ttagcttctt ttafctttatt tttttatcag 
tcagggatgt ttcaaccctt atacaaaacc 
ctaaggtacc 

<210> 28 
<211> 451 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



tatgtatata aatttattat aatataacat 4055 

atctatacaa tcgtttagcc ttgctggacg 4115 

atttgacttt ttggttattt aacaaattat 4175 

tatcggcaag gaaataaaat taaattagga 4235 

accaacttcc acaggaaggt caggtcgggg 4295 

taatttgggt tgtcttgttt gctgcataat 4355 

ttagcagtag agcaatggtt gaccgtgtgc 4415 

caaagaataa ataaaataaa atgagacact 4475 

ccaaaaacaa gtttcctagc accctaccaa 4535 

4545 



<400> 28 



Met 


Asn 


Gly 


Leu 


Glu 


Thr 


His 


Asn 


Thr Arg Leu 


Cys 


He 


Val 


Gly 


Ser 


1 






5 








10 








15 




Gly 


Pro 


Ala 


Ala 


His 


Thr 


Ala 


Ala 


He Tyr Ala 


Ala 


Arg 


Ala 


Glu 


Leu 






20 










25 






30 






Lys 


Pro 


Leu 


Leu 


Phe 


Glu 


Gly 


Trp 


Met Ala Asn 


Asp 


He 


Ala 


Pro 


Gly 




35 








40 






45 








Gly 


Gin 


Leu 


Thr 


Thr 


Thr 


Thr 


Asp 


Val Glu Asn 


Phe 


Pro 


Gly 


Phe 


Pro 


50 










55 






60 










Glu 


Gly 


He 


Leu 


Gly 


Val 


Glu 


Leu 


Thr Asp Lys 


Phe 


Arg 


Lys 


Gin 


Ser 


65 








70 






75 










80 


Glu 


Arg 


Phe 


Gly 


Thr 


Thr 


He 


Phe 


Thr Glu Thr 


Val 


Thr 


Lys 


Val 


Asp 






85 








90 








95 




Phe 


Ser 


Ser 


Lys 


Pro 


Phe 


Lys 


Leu 


Phe Thr Asp 


Ser 


Lys 


Ala 


He 


Leu 








100 










105 






110 






Ala 


Asp 


Ala 


Val 


He 


Leu 


Ala 


Thr 


Gly Ala Val 


Ala 


Lys 


Arg 


Leu 


Ser 






115 










120 






125 








Phe 


Val 


Gly 


Ser 


Gly 


Glu 


Gly 


Ser 


Gly Gly Phe 


Trp 


Asn 


Arg 


Gly 


lie 




130 










135 






140 










Ser 


Ala 


Cys 


Ala 


Val 


Cys 


Asp 


Gly 


Ala Ala Pro 


He 


Til, ^. 

Pne 


Arg 


Asn 


Lys 


145 










150 






155 










160 


Pro 


Leu 


.a. -La 


Tjr- 1 

vaj. 


j. j.e 


taiy 


fll *r 

vjr±y 




i\sp oci /us 


wet 


V3-LU 


ni ii 


-H-i. ci 


Asn 










165 




170 


lie 






175 




Phe 


Leu 


Thr 


Lys 


Tyr 


Gly 


Ser 


Lys 


Val Tyr He 


His 


Arg 


Arg 


Asp 








180 










185 






190 






Ala 


Phe 


Arg 


Ala 


Ser 


Lys 


He 


Met 


Gin Gin Arg 


Ala 


Leu 


Ser 


Asn 


Pro 






195 








200 




205 








Lys 


He 


Asp 


Val 


He 


Trp 


Asn 


Ser 


Ser Val Val 


Glu 


Ala 


Tyr 


Gly 


Asp 




210 










215 






220 










Gly 


Glu 


Arg 


Asp 


Val 


Leu 


Gly 


Gly 


Leu Lys Val 


Lys 


Asn 


Val 


Val 


Thr 


225 










230 






235 










240 


Gly 


Asp 


Val 


Ser 


Asp 


Leu 


Lys 


Val 


Ser Gly Leu 


Phe 


Phe 


Ala 


He 


Gly 










245 








250 








255 




His 


Glu 


Pro 


Ala 


Thr 


Lys 


Phe 


Leu 


Asp Gly Gly 


Val 


Glu 


Leu 


Asp 


Ser 








260 








265 






270 






Asp 


Gly 


Tyr 


Val 


Val 


Thr 


Lys 


Pro 


Gly Thr Thr 


Gin 


Thr 


Ser 


Val 


Pro 






275 










280 






285 








Gly 


Val 


Phe 


Ala 


Ala 


Gly 


Asp 


Val 


Gin Asp Lys 


Lys 


Tyr 


Arg 


Gin 


Ala 




290 










295 






300 










lie 


Thr 


Ala 


Ala 


Gly 


Thr 


Gly 


Cys 


Met Ala Ala 


Leu 


Asp 


Ala 


Glu 


His 


305 










310 






315 










320 


Tyr 


Leu 


Gin 


Glu 


He 


Gly 


Ser 


Gin 


Gin Gly Lys 


Ser 


Asp 


Met 


Ala 


Asp 










325 








330 








335 




Thr 


Ala 


Arg 


Gly 


Thr 


His 


His 


Asp 


He He Gly 


Arg 


Asp 


Gin 


Tyr 


Pro 








340 










345 






350 






Met 


Met 


Gly 


Arg 


Asp 


Arg 


Asp 


Gin 


Tyr Gin Met 


Ser 


Gly 


Arg 


Gly 


Ser 






355 










360 






365 








Asp 


Tyr 


Ser 


Lys 


Ser 


Arg 


Gin 


He 


Ala Lys Ala 


Ala 


Thr 


Ala 


Val 


Thr 
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370 375 380 

Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val Gly Thr 
385 " 390 395 400 

Val lie Ala Leu Thr Val Ala Thr Pro Leu Leu Val lie Phe Ser Pro 

405 410 • 415 

lie Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr Gly Phe 

420 425 430 

Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe Ser Trp 

435 440 445 

He Tyr Lys 
450 



<210> 29 
<211> 55 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 29 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Asp Ser Ala 

15 10 15 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 

20 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg Asp Arg Thr 

35 40 45 

Arg Gly Gly Gin His Thr Thr 
50 55 



<210> 30 
<211> 4922 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1555) . . . (1907) 
<221> CDS 

<222> (2148) . . . (3690) 
<223> Chimeric 
<400> 30 

ctgcaggaat tcat'tgtact cccagtatca 
tggtttttta cctctattta aagg'ggtttt 
ttacttgtta ctttaatttc tcataatctt 
tatccctaca aatttattat ttgttaaaca 
ccgtctatct ttaatgtagt ctaacatttt 
agcgttggta gaaagcataa tgatttattc 
caatataaac aaattcttta ccttaagaag 
ttatcaaata tttttcaacc acgtaaatct 
aatttaactc cataattttt ttattcgact 
agccattttt ttctttgaat aaaaaaatcc 
aaatttcacc aaacaatcat ttgtggtatt 
ataattccca tttgacacta cggaagtaac 
cttctaaagt aattttaata atagttacta 
tattacttct aaaaaattaa ttagatataa 
aattgttgaa tttgtgacta ttgatttatt 
agtttaaagt aaatataagt aatgtagtag 
tataagattt atggtggact aattttcata 
atgtaagtcc gtaactggaa ttactgtggg 
atgcatggat gcttgcgcaa gaaaaagaca 
acaaaacgca atcacacaac caactcaaat 



ttatagtgaa agttttggct ctctcgccgg 60 
ccacctaaaa attctggtat cattctcact 120 
tggttgaaat tatcacgctt ccgcacacga 180 
ttttcaaacc gcataaaatt ttatgaagtc 240 
catattgaaa tatataattt acttaatttt 300 
ttattcttct tcatataaat gtttaatata 360 
gatttcccat tttatatttt aaaaatatat 420 
cataataata agttgtttca aaagtaataa 480 
gatcttaaag caacacccag tgacacaact 540 
aattatcatt gtattttttt tatacaatga 600 
tctgaagcaa gtcatgttat gcaaaattct 660 
tgaagatctg cttttacatg cgagacacat 720 
tattcaagat ttcatatatc aaatactcaa 780 
ttaaaatatt acttttttaa ttttaagttt 840 
attctactat gtttaaattg ttttatagat 900 
agtgttagag tgttacccta aaccataaac 960 
tatttcttat tgcttttacc ttttcttggt 1020 
ttgccatggc actctgtggt cttttggttc 1080 
aagaacaaag aaaaaagaca aaacagagag 1140 
tagtcactgg ctgatcaaga tcgccgcgtc 1200 



-26- 



WO 02/50289 



PCT/US01/50240 



catgtatgtc 
cccatctcaa 
tatatattca 
atgcgtgtca 
atacctataa 
actactctac 



taaatgccat 
cccacacaca 
ttctcttccg 
tcccatgccc 
atacctctaa 
tactataata 



gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
actcatattc 



catgcacttt 
atcatcacca 
caacacacgt 
accaccttct 
atccatccat 
aatactactc 



aaatggctca 1260 
caaccacctg 1320 
caacctgcat 1380 
ctcttatata 1440 
ccagagtact 1500 
tact atg 1557 
Met 
1 



gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 
Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin 
5 10 15 



1605 



tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 
Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 
20 25 30 



1653 



gga tct gac tac tec aag tct agg cag att get aaa get gca act get 
Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala 
35 40 45 



1701 



gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 
Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 
50 55 60 65 



1749 



gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 
Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 
70 75 80 



1797 



age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 
Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 
85 90 95 



1845 



ggt ttt ctt tec tct gga ggg ttt ggc att gee get ata acc gtt ttc 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe 
100 105 110 



1893 



tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 
Ser Trp He Tyr Lys 
115 



1947 



atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2007 
atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2067 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2127 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 2178 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
120 125 



gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 2226 
Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 ~ 135 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 2274 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
145 " 150 ~ 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg aac acc 2322 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Asn Thr 
165 170 175 

act cct tct gcg cat gag acg ata cac gaa gtg ate gtt att ggc tec 2370 
Thr Pro Ser Ala His Glu Thr He His Glu Val He Val He Gly Ser 
180 185 190 

ggt cca gca ggc tac act get gee ctg tac gee get cgt gca cag eta 2418 
Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg Ala Gin Leu 
195 200 205 
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aca ccg ctg gta ttt gag ggt acc tea ttc ggc ggc gcg ctg atg acc 2466 
Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala Leu Met Thr 
210 215 220 

acc acc gag gtg gaa aac tac cca ggt ttt cgc aac ggc ata acc ggc 2514 
Thr Thr Glu Val Glu Asn Tyr Pro Gly Phe Arg Asn Gly lie Thr Gly 
225 230 235 240 

ccg gag ttg atg gac gat atg cgt gaa cag gca ctg cga ttc ggc gcg 2562 
Pro Glu Leu Met Asp Asp Met Arg Glu Gin Ala Leu Arg Phe Gly Ala 
245 250 255 

gaa ctg egg acc gaa gac gtc gag teg gta tea ttg cgt ggc ccg ate , 2610 
Glu Leu Arg Thr Glu Asp Val Glu Ser Val Ser Leu Arg Gly Pro lie 
260 265 270 

aaa teg gtc gtc acc get gaa gga cag act tat cag gee cga gee gtc 2658 
Lys Ser Val Val Thr Ala Glu Gly Gin Thr Tyr Gin Ala Arg Ala Val 
275 280 285 

ate etc gee atg ggt acc tec gtg cgt tat eta cag ate ccc ggc gag 2706 
lie Leu Ala Met Gly Thr Ser Val Arg Tyr Leu Gin lie Pro Gly Glu 
290 295 300 

caa gaa ttg eta gga cgt ggc gtg agt gca tgc gcg acc tgc gac ggg 2754 
Gin Glu Leu Leu Gly Arg Gly Val Ser Ala Cys Ala Thr Cys Asp Gly 
305 310 315 320 

tec ttt ttc cgc ggc caa gac att gec gtc att ggc ggt gga gac tea 2802 
Ser Phe Phe Arg Gly Gin Asp lie Ala Val lie Gly Gly Gly Asp Ser 
325 ~ 330 335 

gcg atg gag gaa gee etc ttt ttg acc egg ttc gee cgc age gtc acg 2850 
Ala Met Glu Glu Ala Leu Phe Leu Thr Arg Phe Ala Arg Ser Val Thr 
340 345 350 

etc gtg cac cgc cgc gac gaa ttc cga get tct aag ate atg etc ggt 2898 
Leu Val His Arg Arg Asp Glu Phe Arg Ala Ser Lys lie Met Leu Gly 
355 360 365 

cgc gee cgt aac aat gac aag ate aaa ttc ate acc aac cac acc gtg 2946 
Arg Ala Arg Asn Asn Asp Lys lie Lys Phe lie Thr Asn His Thr Val 
370 375 380 

gtc gcg gtg aac ggg tat aca aca gtg acc gga ttg egg ttg cgt aac 2994 
Val Ala Val Asn Gly Tyr Thr Thr Val Thr Gly Leu Arg Leu Arg Asn 
385 390 395 400 

acc aca acg gga gag gaa acc acg eta gta gtg acc ggg gtt ttt gtt 3042 
Thr Thr Thr Gly Glu Glu Thr Thr Leu Val Val Thr Gly Val Phe Val 
405 410 415 

gca att ggc cat gaa cca cgt tec age ctg gtg age gat gtc gtc gac 3090 
Ala He Gly His Glu Pro Arg Ser Ser Leu Val Ser Asp Val Val Asp 
420 425 430, 

ata gac ccg gat ggc tac gtc ctg gtg aaa gga cgt acg acg agt aca 3138 
He Asp Pro Asp Gly Tyr Val Leu Val Lys Gly Arg Thr Thr Ser Thr 
435 440 445 

teg atg gac ggc gtt ttt gcg gec ggc gac ctg gta gat cgc acc tac 3186 
Ser Met Asp Gly Val Phe Ala Ala Gly Asp Leu Val Asp Arg Thr Tyr 
450 455 460 

egg cag gcg ate act gee gca ggt agt ggc tgt gec gec gec ate gac 3*234 
Arg Gin Ala He Thr Ala Ala Gly Ser Gly Cys Ala Ala Ala He Asp 
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465 



470 



475 



480 



gcc gaa cgt tgg ttg gcg gag cat gcc ggg tea aaa get aac gaa aca 
Ala Glu Arg Trp Leu Ala Glu His Ala Gly Ser Lys Ala Asn Glu Thr 
485 490 495 



3282 



aca gag gaa act gga gac gtt gac agt acc gac aca acc gat tgg age 
Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr Thr Asp Trp Ser 
500 505 510 



3330 



act gcg atg act gac gcc aag aac gcc ggg gtc aca ata gaa gtg acc 
Thr Ala Met Thr Asp Ala Lys Asn Ala Gly Val Thr lie Glu Val Thr 
515 520 525 



3378 



gat get tec ttt ttc gca gac gtc tta tec agt aat aag cct gtg tta 
Asp Ala Ser Phe Phe Ala Asp Val Leu Ser Ser Asn Lys Pro Val Leu 
530 535 540 



3426 



gtt gat ttt tgg gca aca tgg tgt gga ccc tgc aag atg gta gcg ccg 
Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys Lys Met Val Ala Pro 
545 550 555 560 



3474 



gta etc gaa gag ate gcg tec gaa caa cga aac cag etc act gtc gcc 
Val Leu Glu Glu lie Ala Ser Glu Gin Arg Asn Gin Leu Thr Val Ala 
565 570 575 



3522 



aag tta gat gta gac acc aac. ccg gaa atg gca cgc gag ttc cag gtc 
Lys Leu Asp Val Asp Thr Asn Pro Glu Met Ala Arg Glu Phe Gin Val 
580 . 585 590 



3570 



gtg teg ata ccc aca atg att ctg ttc cag ggt ggc caa cca gta aaa 
Val Ser lie Pro Thr Met He Leu Phe Gin Gly Gly Gin Pro Val Lys 
595 600 605 



3618 



cgc ate gtt ggc get aag ggc aaa gca gcg tta eta cgt gac ctt tec 
Arg He Val Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg Asp Leu Ser 
610 615 620 



3666 



gac gtg gta cct aac etc aat taa gctttaaata agtatgaact aaaatgcatg 
Asp Val Val Pro Asn Leu Asn * 
625 630 



3720 



taggtgtaag 
actgagctcc 
etatctatge 
tegtgaegge 
ctaaacaatt 
tataatggaa 
taggatgtta 
tatactaccc 
ccatgatatt 
tactctgtat 
cagata'aaaa 
aaataataaa 
aaaaaagtaa 
ttatttaaac 
actatatgaa 
ggtgtgtccc 
aaaacaggca 
taaaacacta 
ttttatttta 
gtttcaaccc 
cc 



agctcatgga 
atctcacttc 
accttattgt 
ttatggaatg 
ctaactttag 
gaagtttgtc 
aggagacata 
atttatatat 
tctaatattt 
aaaggttgga 
aaaaattatg 
taacatataa 
atattgtcat 
gagagtaaac 
attttttttt 
aatccttata 
agggaaattt 
cacataaccc 
tttttttatc 
ttatacaaaa 



gagcatggaa 
ttctatgaat 
tctatgataa 
cttcaaatag 
cattgtgaac 
tccatttata 
acaattataa 
tatacttatc 
tagttgatat 
tcatccttaa 
agttggtttg 
tatatgtata 
aaatctatac 
atatttgact 
tttateggea 
caaccaactt 
tttaatttgg 
ttttagcagt 
agcaaagaat 
ccccaaaaac 



tattgtatcc 
aaacaaagga 
atttcctctt 
tacaaaaaca 
gagacataag 
tattatatat 
agagagaagt 
cacttattta 
gtatatgaaa 
agtgggtcta 
ataaaatatt 
taaatttatt 
aatcgtttag 
ttttggttat 
aggaaataaa 
ccacaggaag 
gttgtcttgt 
agagcaatgg 
aaataaaata 
aagtttccta 



gaccatgtaa 
tgttatgata 
attattataa 
aatgtgtact 
tgttaagaag 
tacccactta 
t tgt ate cat 
atgtctttat 
gggtactatt 
tttaatttta 
gaaggattta 
ataatataac 
ccttgctgga 
ttaacaaatt 
attaaattag 
gtcaggtegg 
ttgetgeata 
ttgaccgtgt 
aaatgagaca 
gcaccctacc 



cagtataata 
tattaacact 
atcatctgaa 
ataagacttt 
aeataacaat 
tgtattatat 
ttatatatta 
aaggtttgat 
tgaactctct 
ttgettctta 
aaataataat 
atttatctat 
cgactctcaa 
attatttaae 
gagggacaat 
ggacaacaaa 
atttatgeag 
gcttagcttc 
cttcagggat 
aactaaggta 



3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4922 



<210> 31 
<211> 118 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 31 



Met 


Ala 




x iij. nia i^x. y 


rtlv Thr His His Asn Tie* TIp 


Gly Arg Asp 














JLD 


Gin 


Tvr 
xyir 


Pro 


ncL i ic l. \j j- y 


Arcr A^tj Arcr Asr» C5"ln Tvr* fil n 


Met 


Ser Gly 












30 




Arg 


Gly 


Ser 


Asp Tyr Ser 


Lys Ser Arg Gin He Ala Lys 


Ala 


Ala Thr 






35 




40 45 






Ala 


Val 


Thr 


Ala Gly Gly 


Ser Leu Leu Val Leu Ser Ser 


Leu 


Thr Leu 




50 




55 60 






Val 


Gly 


Thr 


Val He Ala 


Leu Thr Val Ala Thr Pro Leu 


Leu 


Val He 


65 






70 


75 




80 


Phe 


Ser 


Pro 


He Leu Val 


Pro Ala Leu He Thr Val Ala 


Leu 


Leu He 








85 


90 




95 


Thr 


Gly 


Phe 


Leu Ser Ser 


Gly Gly Phe Gly He Ala Ala 


He 


Thr Val 








100 


105 


110 




Phe 


Ser 


Trp 


He Tyr Lys 












115 











<210> 32 
<211> 513 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<400> 32 



Tyr 


Ala 


Thr 


Gly 


Glu 


His 


Pro 


Gin Gly Ser Asp Lys Leu Asp Ser Ala 


1 








5 






10 15 


Arg 


Met 


Lys 


Leu 


Gly 


Ser 


Lys 


Ala Gin Asp Leu Lys Asp Arg Ala Gin 








20 








25 30 


Tyr 


Tyr 


Gly 


Gin 


Gin 


His 


Thr 


Gly Gly Glu His Asp Arg Asp Arg Thr 






35 










40 45 


Arg 


Gly 


Gly 


Gin 


His 


Thr 


Thr 


Met Asn Thr Thr Pro Ser Ala His Glu 




50 










55 


60 


Thr 


He 


His 


Glu 


Val 


He 


Val 


He Gly Ser Gly Pro Ala Gly Tyr Thr 


65 










70 




75 80 


Ala 


Ala 


Leu 


Tyr 


Ala 


Ala 


Arg 


Ala Gin Leu Thr Pro Leu Val Phe Glu 








85 




90 95 


Gly 


Thr 


Ser 


Phe 


Gly 


Gly 


Ala 


Leu Met Thr Thr Thr Glu Val Glu Asn 








100 








105 110 


Tyr 


Pro 


Gly 


Phe 


Arg 


Asn 


Gly 


He Thr Gly Pro Glu Leu Met Asp Asp 






115 










120 125 


Met 


Arg 


Glu 


Gin 


Ala 


Leu 


Arg 


Phe Gly Ala Glu Leu Arg Thr Glu Asp 




130 










135 


140 


Val 


Glu 


Ser 


Val 


Ser 


Leu 


Arg 


Gly Pro He Lys Ser Val Val Thr Ala 


145 










150 




155 160 


Glu 


Gly 


Gin 


Thr 


Tyr 


Gin 


Ala 


Arg Ala Val He Leu Ala Met Gly Thr 










165 






170 175 


Ser 


Val 


Arg 


Tyr 


Leu 


Gin 


He 


Pro Gly Glu Gin Glu Leu Leu Gly Arg 








180 








185 190 


Gly 


Val 


Ser 


Ala 


Cys 


Ala 


Thr 


Cys Asp Gly Ser Phe Phe Arg Gly Gin 






195 










200 205 


Asp 


He 


Ala 


Val 


He 


Gly 


Gly 


Gly Asp Ser Ala Met Glu Glu Ala Leu 




210 










215 


220 


Phe 


Leu 


Thr 


Arg 


Phe 


Ala 


Arg 


Ser Val Thr Leu Val His Arg Arg Asp 


225 










230 




235 240 


Glu 


Phe 


Arg 


Ala 


Ser 


Lys 


He 


Met Leu Gly Arg Ala Arg Asn Asn Asp 










245 






250 255 


Lys 


He 


Lys 


Phe 


lie 


Thr 


Asn 


His Thr Val Val Ala Val Asn Gly Tyr 



260 265 270 
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Thr 


Thr 


Val 


Thr 


Gly 


Leu 


Arg 


Leu Arg Asn 


Thr 


Thr 


Thr 


Gly 


Glu 


Glu 






275 










280 








285 








Thr 


Thr 

2Q0 
<c y \j 


Leu 


Val 


Val 


Thr 


Gly 
295 


Val Phe 


Val 


Ala 


He 
300 


Gly 


His 


Glu 


Pro 


Arg 


Ser 


Ser 


Leu 


Val 


Ser 


Asp 


Val Val Asp 


He 


Asp 


Pro 


Asp 


Gly 


Tvr 

JL 


~j \j ~j 










310 








115 

J JL ZJ 










320 


Val 


T.aii 


Val 

V GLJ. 


Lys 


Ol v 
\j jl y 


Arg 


Thr 


Thr Ser 


Thr 
330 


Ser 


Met 


Asp 


Glv 


Val 
335 


Phe 








21 an 


Leu 


Val 


Asp 


Arg Thr Tyr 


Arg 


Gin 


Ala 


He 


Thr 


Ala 








14.0 








345 










350 






Ala 




Ser 


Gly 


Cys 


Ala 


Ala 


Ala He Asp 


Ala 


Glu 




Tm 
AA r 


Leu 


Ala 
















360 








365 








Glu 


His 


Ala 


Gly 


Ser 


Lys 


Ala 


Asn Glu 


Thr 


Thr 


Glu 


Glu 


Thr 


Gly 


Asp 




370 






375 








380 










Val 


Asp 


Ser 


Thr 


Asp 


Thr 


Thr 


Asp Trp 


Ser 


Thr 


Ala 


Met 


Thr 


Asp 


Ala 


385 








390 








395 










400 


Lys 


Asn 


Ala 


Gly 


Val 


Thr 


He 


Glu Val 


Thr 


Asp 


Ala 


Ser 


Phe 


Phe 


Ala 






405 








410 








415 




A on 


V Ct JL 




4.9 0 


OCX 






Pro Val 
425 


Leu 


Val 




Phe 


Trn 
4,10 


Ala 


Thr 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Met 


Val Ala 


Pro 


Val 


Leu 


Glu 


Glu 


He 


Ala 




ATE 










440 








445 








Ser 


Glu 


Gin 


Arg 


Asn 


Gin 


Leu 

d 55 


Thr Val 


Ala 


Lys 


Leu 


Asp 


Val 


Asp 


Thr 


Asn 


Pro 


Glu 


Met 


Ala 


Arg 


Glu 


Phe Gin 


Val 


Val 


Ser 


He 


Pro 


Thr 


Met 


465 










470 








475 










480 


lie 


Leu 


Phe 


Gin 


Gly 
485 


Gly 


Gin 


Pro Val 


Lys 
490 


Arg 


He 


Val 


Gly 


Ala 
495 


Lys 


Gly 


Lys 


Ala 


Ala 
500 


Leu 


Leu 


Arg 


Asp Leu 
505 


Ser 


Asp 


Val 


Val 


Pro 
510 


Asn 


Leu 



Asn 



<210> 33 
<211> 4935 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1554) . . . (1906) 
<221> CDS 

<222> (2147) . . . (3701) 
<223> Chimeric 



<400> 33 

ctgcaggaat tcattgtact 
tggtttttta cctctattta 
ttacttgtta ctttaatttc 
tatccctaca aatttattat 
ccgtctatct ttaatgtagt 
agcgttggta gaaagcataa 
caatataaac aaattcttta 
ttatcaaata tttttcaacc 
aatttaactc cataattttt 
agccattttt ttctttgaat 
aaatttcacc aaacaatcat 
ataattccca tttgacacta 
cttctaaagt aattttaata 
tattacttct aaaaaattaa 
aattgttgaa tttgtgacta 
agtttaaagt aaatataagt 
tataacatt't atggtggact 
atgtaagtcc gtaactagaa 



cccagtatca ttatagtgaa 
aaggggtttt ccacctaaaa 
tcataatctt tggttgaaat 
ttgttaaaca ttttcaaacc 
ctaacatttt catattgaaa 
agatttattc ttattcttct 
ccttaagaag gatttcccat 
acgtaaatct cataataata 
ttattcgact gatcttaaag 
aaaaaaatcc aattatcatt 
ttgtggtatt tctgaagcaa 
cggaagtaac tgaagatctg 
atagttacta tattcaagat 
ttagatataa ttaaaatatt 
ttgatttatt attctactat 
aatgtagtag agtgttagag 
aattttcata tatttcttat 
ttacagtggg ttgccatggc 



agttttggct ctctcgccgg 60 
attctggtat cattctcact 120 
tatcacgctt ccgcacacga 180 
gcataaaatt ttatgaagtc 240 
tatataattt acttaatttt 300 
tcatataaat gtttaatata 360 
tttatatttt aaaaatatat 420 
agttgtttca aaagtaataa 480 
caacacccag tgacacaact 540 
gtattttttt tatacaatga 600 
gtcatgttat gcaaaattct 660 
cttttacatg cgagacacat 720 
ttcatatatc aaatactcaa 780 
acttttttaa ttttaagttt 840 
gtttaaattg ttttatagat 900 
tgttacccta aaccataaac 960 
tgcttttacc ttttcttggt 1020 
actctgtggt cttttggttc 1080 
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atgcatgggt cttgcgcaag aaaaagacaa agaacaaaga aaaaagacaa aacagagaga 1140 
caaaacgcaa tcacacaacc aactcaaatt agtcactggc tgatcaagat cgccgcgtcc 1200 
atgtatgtct aaatgccatg caaagcaaca cgtgcttaac atgcacttta aatggctcac 1260 
ccatctcaac ccacacacaa acacattgcc tttttcttca tcatcaccac aaccacctgt 1320 
atatattcat tctcttccgc cacctcaatt tcttcacttc aacacacgtc aacctgcata 1380 
tgcgtgtcat cccatgccca aatctccatg catgttccaa ccaccttctc tcttatataa 1440 
tacctataaa tacctctaat atcactcact tctttcatca tccatccatc cagagtacta 1500 
ctactctact actataatac cccaacccaa ctcatattca atactactct act atg 1556 

Met 
1 

gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 1604 
Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin 
5 10 15 

tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 1652 
Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 
20 25 30 

gga tct gac tac tec aag tct agg cag att get aaa get gca act get 1700 
Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala 
35 40 45 

gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 1748 
Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 
50 55 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 1796 
Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 
70 75 80 

age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 1844 
Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 
85 90 95 

ggt ttt ctt tec tct gga ggg ttt ggc att gee get ata acc gtt ttc 1892 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe 
100 105 110 

tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 1946 
Ser Trp He Tyr Lys 
115 

atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2006 
atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2066 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2126 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 2177 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
120 125 

gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 2225 
Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 2273 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly, Gly Glu 
145 150 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg aat ggt 2321 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Asn Gly 
165 170 175 

etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc cca gcg 2369 
Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser Gly Pro Ala 
180 ~ 185 190 

gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa cct ctt 2417 
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Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu Lys Pro Leu 
195 200 ^ 205 

etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt caa eta 2465 
Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly Gly Gin Leu 
210 215 220 

aca acc acc acc gac gtc gag aat ttc ccc gga ttt cca gaa ggt att 2513 
Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Glu Gly lie 
225 230 235 240 

etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag cga ttc 2561 
Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu Arg Phe 
245 250 255 

ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc tct teg 2609 
Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe Ser Ser 
260 265 ' 270 

aaa ccg ttt aag eta ttc aca gat tea aaa gec att etc get gac get 2657 
Lys 1 Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala Asp Ala 
275 280 285 

gtg att etc get act gga get gtg get aag egg ctt age ttc gtt gga 2705 
Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe Val Gly 
290 295 300 

tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec get tgt 2753 
Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser Ala Cys 
305 310 " 315 320 

get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct ctt gcg 2801 
Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro Leu Ala 
325 330 335 

gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt ctt aca 2849 
Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe Leu Thr 
340 345 350 

aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get ttt aga 2897 
Lys Tyr Gly Ser Lys Val Tyr lie He His Arg Arg Asp Ala Phe Arg 
355 360 365 

gcg tct aag att atg cag cag cga get ttg tct aat cct aag att gat 2945 
Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys He Asp 
370 375 380 

gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga gaa aga 2993 
Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly Glu Arg 
385 390 395 400 

gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga gat gtt 3041 
Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly Asp Val 
405 ~ 410 " 415 

tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat, gag cca 3089 
Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His Glu Pro 
420 425 430 

get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat ggt tat 3137 
Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp Gly Tyr 
435 440 ' 445 

gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga gtt ttc 3185 
Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly Val Phe 
450 455 460 
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get gcg ggt gat gtt cag gat aag aag tat agg caa gec ate act get 
Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala lie Thr Ala 
465 470 475 480 



3233 



gca gga act ggg tgc atg gca get ttg gat gca gag cat tac tta caa 
Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr Leu Gin 
485 490 495 



3281 



gag att get gga teg aag get aac gag ace ace gag gaa act gga gat 
Glu lie Ala Gly Ser Lys Ala Asn Glu Thr Thr Glu Glu Thr Gly Asp 
500 ~ 505 510 



3329 



gtt gac teg acg gat act acg gat tgg teg acg get atg gaa gaa gga 
Val Asp Ser Thr Asp Thr Thr Asp Trp Ser Thr Ala Met Glu Glu Gly 
515 520 525 



3377 



caa gtg ate gee tgc cac acc gtt gag aca tgg aac gag cag ctt cag 
Gin Val He Ala Cys His Thr Val Glu Thr Trp Asn Glu Gin Leu Gin 
530 535 540 



3425 



aag get aat gaa tec aaa act ctt gtg gtg gtt gat ttc acg get tct 
Lys Ala Asn Glu Ser Lys Thr Leu Val Val Val Asp Phe Thr Ala Ser 
545 550 555 560 



3473 



tgg tgt gga cca tgt cgt ttc ate get cca ttc ttt get gat ttg get 
Trp Cys Gly Pro Cys Arg Phe He Ala Pro Phe Phe Ala Asp Leu Ala 
565 570 575 



3521 



aag aaa ctt cct aac gtg ctt ttc etc aag. gtt gat act gat gaa ttg 
Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val Asp Thr Asp Glu Leu 
580 585 " 590 



3569 



aag teg gtg gca agt gat tgg gcg ata cag gcg atg cca acc ttc atg 
Lys Ser Val Ala Ser Asp Trp Ala He Gin Ala Met Pro Thr Phe Met 
595 600 605 



3617 



ttt ttg aag gaa ggg aag att ttg gac aaa gtt gtt gga gee aag aaa 
Phe Leu Lys Glu Gly Lys He Leu Asp Lys Val Val Gly Ala Lys Lys 
610 615 620 



3665 



gat gag ctt cag tct acc att gec aaa cac ttg get taagcttaaa 
Asp Glu Leu Gin Ser Thr lie Ala Lys His Leu Ala 
625 630 635 



3711 



taagtatgaa 
ccgaccatgt 
gatgttatga 
ttattattat 
caaatgtgta 
agtgttaaga 
attacccact 
gtttgtatcc 
taatgtcttt 
aagggtacta 
tatttaattt 
ttgaaggatt 
ttataatata 
agecttgetg 
atttaacaaa 
aaattaaatt 
aggtcaggtc 
gtttgetgea 
ggttgaccgt 
taaaatgaga 
tagcacccta 



etaaaatgea 
aaeagtataa 
tatattaaca 
aaatcatctg 
ctataagact 
agacataaca 
tatgtattat 
atttatatat 
ataaggtttg 
tttgaactct 
tattgettet 
taaaataata 
acatttatct 
gacgactctc 
ttattattta 
aggagggaca 

ggggacaaca 

taatttatgc 
gtgettaget 
cacttcaggg 
ccaactaagg 



tgtaggtgta 
taactgagct 
ctctatctat 
aatcgtgacg 
ttctaaacaa 
attataatgg 
attaggatgt 
tatatactac 
atccatgata 
cttactctgt 
tacagataaa 
ataaataata 
ataaaaaagt 
aattatttaa 
acactatatg 
atggtgtgtc 
aaaaaacagg 
agtaaaacac 
tcttttattt 
atgtttcaac 
tacc 



agagctcatg 
ccatctcact 
gcaccttatt 
gcttatggaa 
ttctaacttt 
aagaagtttg 
taaggagaca 
ccatttatat 
tttctaatat 
ataaaggttg 
aaaaaaatta 
aataacatat 
aaatattgtc 
acgagagtaa 
aaattttttt 
ccaatcctta 
caagggaaat 
tacacataac 
tattttttta 
ccttatacaa 



gagagcatgg 
tcttctatga 
gttctatgat 
tgcttcaaat 
agcattgtga 
tctccattta 
taacaattat 
attatactta 
tttagttgat 
gatcatcctt 
tgagttggtt 
aatatatgta 
ataaatctat 
acatatttga 
tttttategg 
tacaaccaac 
tttttaattt 
ccttttagca 
tcagcaaaga 
aaccccaaaa 



aatattgtat 
ataaacaaag 
aaatttcctc 
agtacaaaaa 
acgagacata 
tatattatat 
aaagagagaa 
tccacttatt 
atgtatatga 
aaagtgggtc 
tgataaaata 
,tataaattta 
acaatcgttt 
ctttttggtt 
caaggaaata 
ttccacagga 
gggttgtctt 
gtagagcaat 
ataaataaaa 
acaagtttcc 



3771 
3831 
3891 
3951 
4011 
4071 
4131 
4191 
4251 
4311 
4371 
4431 
4491 
4551 
4611 
4671 
4731 
4791 
4851 
4911 
£935 



<210> 34 
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<211> 118 

<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SITE 

<222> (1) . . . (118) 

<223> oleosin 

<223> Chimeric 



<400> 34 



Met 


Ala 


Asp 


Thr 


Ala 


Arg 


Gly Thr His 


1 








5 






Gin 


Tyr 


Pro 


Met 


Met 


Gly 


Arg Asp Arg 








20 






25 


Arg 


Gly 


Ser 


Asp 


Tyr 


Ser 


Lys Ser Arg 






35 








40 


Ala 


Val 


Thr 


Ala 


Gly 


Gly 


Ser Leu Leu 




50 






55 


Val 


Gly 


Thr 


Val 


He 


Ala 


Leu Thr Val 


65 










70 




Phe 


Ser 


Pro 


He 


Leu 


Val 


Pro Ala Leu 


Thr 


Gly 


Phe 


Leu 


85 
Ser 


Ser 


Gly Gly Phe 








100 






105 


Phe 


Ser 


Trp 


He 


Tyr 


Lys 








115 











<210> 35 
<211> 518 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SITE 
<222> (1) . . . (55) 
<223> oleosin 

<221> SITE 

<222> (56) . . . (383) 

<223> thioredoxin reductase 

<221> SITE 

<222> (384) . . . (406) 

<223> linker 

<221> SITE 

<222> (407) . . . (518) 

<223> thioredoxin 

<223> Chimeric 



<400> 35 



Tyr Ala 


Thr 


Gly 


Glu 
5 


His 


Pro Gin Gly 


1 

Arg Met 


Lys 


Leu 


Gly 


Ser 


Lys Ala Gin 




20 






25 


Tyr Tyr 


Gly 


Gin 


Gin 


His 


Thr Gly Gly 




35 








40 


Arg Gly 


Gly 


Gin 


His 


Thr 


Thr Met Asn 


50 










55 


Arg Leu 


Cys 


He 


Val 


Gly 


Ser Gly Pro 


65 








70 




Tyr Ala 


Ala 


Arg 


Ala 


Glu 


Leu Lys Pro 








85 





PCT/US01/50240 



His Asp He He Gly Arg Asp 
10 15 
Asp Gin Tyr Gin Met Ser Gly 
30 

Gin He Ala Lys Ala Ala Thr 
45 

Val Leu Ser Ser Leu Thr Leu 
60 

Ala Thr Pro Leu Leu Val lie 

75 80 
He Thr Val Ala Leu Leu He 
90 95 
Gly He Ala Ala He Thr Val 
110 



Ser Asp Lys Leu Asp Ser Ala 
10 15 
Asp Leu Lys Asp Arg Ala Gin 
30 

Glu His Asp Arg Asp Arg Thr 
45 

Gly Leu Glu Thr His Asn Thr 
60 

Ala Ala His Thr Ala Ala He 

75 80 
Leu Leu Phe Glu Gly Trp Met 
90 95 
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Ala Asn Asp lie Ala Pro Gly Gly Gin Leu Thr Thr Thr Thr Asp Val 

100 105 110 

Glu Asn Phe Pro Gly Phe Pro Glu Gly He Leu Gly Val Glu Leu Thr 

115 120 125 

Asp Lys Phe Arg Lys Gin Ser Glu Arg Phe Gly Thr Thr He Phe Thr 

130 "* 135 " 140 

Glu Thr Val Thr Lys Val Asp Phe Ser Ser Lys Pro Phe Lys Leu Phe 
145 " 150 155 ~ 160 

Thr Asp Ser Lys Ala He Leu Ala Asp Ala Val He Leu Ala Thr Gly 

165 170 175 

Ala Val Ala Lys Arg Leu Ser Phe Val Gly Ser Gly Glu Gly Ser Gly 

180 185 190 

Gly Phe Trp Asn Arg Gly He Ser Ala Cys Ala Val Cys Asp Gly Ala 

195 200 205 

Ala Pro He Phe Arg Asn Lys Pro Leu Ala Val He Gly Gly Gly Asp 

210 215 220 

Ser Ala Met Glu Glu Ala Asn Phe Leu Thr Lys Tyr Gly Ser Lys Val 
225 230 235 ~ 240 

Tyr He He His Arg Arg Asp Ala Phe Arg Ala Ser Lys He Met Gin 

245 250 255 

Gin Arg Ala Leu Ser Asn Pro Lys He Asp Val He Trp Asn Ser Ser 

260 265 270 

Val Val Glu Ala Tyr Gly Asp Gly Glu Arg Asp Val Leu Gly Gly Leu 

275 280 * 285 

Lys Val Lys Asn Val Val Thr Gly Asp Val Ser Asp Leu Lys Val Ser 

290 295 300 

Gly Leu Phe Phe Ala He Gly His Glu Pro Ala Thr Lys Phe Leu Asp 
305 310 315 320 

Gly Gly Val Glu Leu Asp Ser Asp Gly Tyr Val Val Thr Lys Pro Gly 

325 330 335 

Thr Thr Gin Thr Ser Val Pro Gly Val Phe Ala Ala Gly Asp Val Gin 

340 345 350 

Asp Lys Lys Tyr Arg Gin Ala He Thr Ala Ala Gly Thr Gly Cys Met 

355 360 365 

Ala Ala Leu Asp Ala Glu His Tyr Leu Gin Glu He Ala Gly Ser Lys 

370 375 380 

Ala Asn Glu Thr Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr 
385 390 . 395 400 

Thr Asp Trp Ser Thr Ala Met Glu Glu Gly Gin Val He Ala Cys His 

405 410 415 

Thr Val Glu Thr Trp Asn Glu Gin Leu Gin Lys Ala Asn Glu Ser Lys 

420 425 430 

Thr Leu Val Val Val Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg 

435 440 ^ 445 

Phe He Ala Pro Phe Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val 

450 455 460 

Leu Phe Leu Lys Val Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp 
465 470 475 480 

Trp Ala He Gin Ala Met Pro Thr Phe Met Phe Leu Lys Glu Gly Lys 

485 490 495 

He Leu Asp Lys Val Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr 

500 505 510 

He Ala Lys His Leu Ala 
515 



<210> 36 
<211> 458 
<212> PRT 

<213> Mycobacterium leprae 
<400> 36 

Met Asn Thr Thr Pro Ser Ala His Glu Thr He His Glu Val He Val 

1.5 10 15 

He Gly Ser Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg 

20 25 30 

Ala Gin Leu Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala 
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35 


40 


45 






Leu 


Met 


Thr 


Thr Thr Glu Val Glu 


Asn Tyr Pro Gly Phe Arg 


Asn 


Gly 




50 




55 


60 






He 


Thr 


Gly 


Pro Glu Leu Met Asp 


Asp Met Arg Glu Gin Ala 


Leu 


Arg 


65 




70 


75 




80 


Phe 


Gly 


Ala 


Glu Leu Arg Thr Glu 


Asp Val Glu Ser Val Ser 


Leu 


Arg 






85 


90 


95 




Gly 


Pro 


He 


Lys Ser Val Val Thr 


Ala Glu Gly Gin Thr Tyr 


Gin 


Ala 








100 


105 110 






Arg 


Ala 


Val 


He Leu Ala Met Gly 


Thr Ser Val Arg Tyr Leu 


Gin 


He 




115 


120 


125 






Pro 


Gly 


Glu 


Gin Glu Leu Leu Gly 


Arg Gly Val Ser Ala Cys 


Ala 


Thr 




130 




135 


140 




Cys 


Asp 


Gly 


Ser Phe Phe Arg Gly 


Gin Asp He Ala Val He 


Gly 


Gly 


145 






150 


155 




160 


Gly 


Asp 


Ser 


Ala Met Glu Glu Ala 


Leu Phe Leu Thr Arg Phe 


Ala 


Arg 








165 


170 


175 




Ser 


Val 


Thr 


Leu Val His Arg Arg 


Asp Glu Phe Arg Ala Ser 


Lys 


He 








180 


185 190 






Met 


Leu 


Gly 


Arg Ala Arg Asn Asn 


Asp Lys He Lys Phe He 


Thr 


Asn 






195 


200 


205 






His 


Thr 


Val 


Val Ala Val Asn Gly 


Tyr Thr Thr Val Thr Gly 


Leu 


Arg 




210 




215 


220 






Leu 


Arg 


Asn 


Thr Thr Thr Gly Glu 


Glu Thr Thr Leu Val Val 


Thr 


Gly 


225 




230 


235 




240 


Val 


Phe 


Val 


Ala lie Gly His Glu 


Pro Arg Ser Ser Leu Val 


Ser 


Asp 








245 


250 


255 




Val 


Val 


Asp 


He Asp Pro Asp Gly 


Tyr Val Leu Val Lys Gly 


Arg 


Thr 








260 


265 " 270 






Thr 


Ser 


Thr 


Ser Met Asp Gly Val 


Phe Ala Ala Gly Asp Leu 


Val 


Asp 






275 


280 


285 






Arg 


Thr 


Tyr 


Arg Gin Ala He Thr 


Ala Ala Gly Ser Gly Cys 


Ala 


Ala 




290 




295 


300 






Ala 


He 


Asp 


Ala Glu Arg Trp Leu 


Ala Glu His Ala Gly Ser 


Lys 


Ala 


305 




Thr 


310 


315 




320 


Asn 


Glu 


Thr Glu Glu Thr Gly 


Asp Val Asp Ser Thr Asp 


Thr 


Thr 








325 


330 


335 




Asp 


Trp 


Ser 


Thr Ala Met Thr Asp 


Ala Lys Asn Ala Gly Val 


Thr 


He 








340 


345 350 






Glu 


Val 


Thr 


Asp Ala Ser Phe Phe 


Ala Asp Val Leu Ser Ser 


Asn 


Lys 






355 


360 


365 






Pro 


Val 


Leu 


Val Asp Phe Trp Ala 


Thr Trp Cys Gly Pro Cys 


Lys 


Met 




370 




375 


380 






Val 


Ala 


Pro 


Val Leu Glu Glu He 


Ala Ser Glu Gin Arg Asn 


Gin 


Leu 


385 






390 


395 




400 


Thr 


Val 


Ala 


Lys Leu Asp Val Asp 


Thr Asn Pro Glu Met Ala 


Arg 


Glu 








405 


410 


415 




Phe 


Gin 


Val 


Val Ser He Pro Thr 


Met He Leu Phe Gin Gly 


Gly 


Gin 








420 


425 430 






Pro 


Val 


Lys 


Arg He Val Gly Ala 


Lys Gly Lys Ala Ala Leu 


Leu 


Arg 






435 


440 


445 






Asp 


Leu 


Ser 


Asp Val Val Pro Asn 


Leu Asn 








450 




455 









<210> 37 
<211> 471 
<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<223> Chimeric 
<400> 37 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 

1 ~ 5 10 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 
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Lys Pro 

Gly Gin 

50 
Glu Gly 
65 

Glu Arg 

Phe Ser 

Ala Asp 

Phe Val 
130 
Ser Ala 
145 

Pro Leu 

Phe Leu 

Ala Phe 

Lys He 
210 
Gly Glu 
225 

Gly Asp 

His Glu 

Asp Gly 

Gly Val 
290 
He Thr 
305 

Tyr Leu 

Thr Gly 

Glu Glu 

His Thr 
370 
Lys Thr 
385 

Arg Phe 

Val Leu 

Asp Trp 

Lys He 
450 
Thr He 
465 



20 

Leu Leu Phe 
35 

Leu Thr Thr 

He Leu Gly 

Phe Gly Thr 
85 

Ser Lys Pro 

100 
Ala Val He 
115 

Gly Ser Gly 

Cys Ala Val 

Ala Val He 
165 

Thr Lys Tyr 

180 
Arg Ala Ser 
195 

Asp Val He 

Arg Asp Val 

Val Ser Asp 
245 

Pro Ala Thr 

260 
Tyr Val Val 
275 

Phe Ala Ala 

Ala Ala Gly 

Gin Glu He 
325 

Asp Val Asp 

340 
Gly Gin Val 
355 

Val Glu Thr 

Leu Val Val 

He Ala Pro 
405 

Phe Leu Lys 

420 
Ala He Gin 
435 

Leu Asp Lys 
Ala Lys His 



Glu Gly 

Thr Thr 

55 
Val Glu 
70 

Thr He 

Phe Lys 

Leu Ala 

Glu Gly 
135 
Cys Asp 
150 

Gly Gly 

Gly Ser 

Lys He 

Trp Asn 
215 
Leu Gly 
230 

Leu Lys 

Lys Phe 

Thr Lys 

Gly Asp 
295 
Thr Gly 
310 

Ala Gly 

Ser Thr 

He Ala 

Trp Asn 
375 
Val Asp 
390 

Phe Phe 

Val Asp 

Ala Met 

Val Val 
455 
Leu Ala 
470 



25 
Trp Met 
40 

Asp Val 

Leu Thr 

Phe Thr 

Leu Phe 
105 
Thr Gly 
120 

Ser Gly 

Gly Ala 

Gly Asp 

Lys Val 
185 
Met Gin 
200 

Ser Ser 

Gly Leu 

Val Ser 

Leu Asp 
265 
Pro Gly 
280 

Val Gin 

Cys Met 

Ser Lys 

Asp Thr 
345 
Cys Glu 
360 

Glu Gin 

Phe Thr 

Ala Asp 

Thr Asp 
425 
Pro Thr 
440 

Gly Ala 



Ala Asn Asp 

Glu Asn Phe 
60 

Asp Lys Phe 
75 

Glu Thr Val 
90 

Thr Asp Ser 

Ala Val Ala 

Gly Phe Trp 
140 

Ala Pro He 

155 
Ser Ala Met 
170 

Tyr He He 

Gin Arg Ala 

Val Val Glu 
220 

Lys Val Lys 

235 
Gly Leu Phe 
250 

Gly Gly Val 

Thr Thr Gin 

Asp Lys Lys 
300 

Ala Ala Leu 

315 
Ala Asn Glu 
330 

Thr Asp Trp 

Glu Gly Gin 

Leu Gin Lys 
380 

Ala Ser Trp 

395 
Leu Ala Lys 
410 

Glu Leu Lys 

Phe Met Phe 

Lys Lys Asp 
460 



30 
He Ala 
45 

Pro Gly 

Arg Lys 

Thr Lys 

Lys Ala 
110 
Lys Arg 
125 

Asn Arg 

Phe Arg 

Glu Glu 

His Arg 
190 
Leu Ser 
205 

Ala Tyr 

Asn Val 

Phe Ala 

Glu Leu 
270 
Thr Ser 
285 

Tyr Arg 

Asp Ala 

Thr Thr 

Ser Thr 
350 
Val He 
365 

Ala Asn 

Cys Gly 

Lys Leu 

Ser Val 
430 
Leu Lys 
445 

Glu Leu 



Pro Gly 

Phe Pro 

Gin Ser 
80 

Val Asp 
95 

lie Leu 

Leu Ser 

Gly He 

Asn Lys 
160 
Ala Asn 
175 

Arg Asp 

Asn Pro 

Gly Asp 

Val Thr 
240 
He Gly 
255 

Asp Ser 

Val Pro 

Gin Ala 

Glu His 
320 
Glu Glu 
335 

Ala Met 

Ala Cys 

Glu Ser 

Pro Cys 
400 
Pro Asn 
415 

Ala Ser 
Glu Gly 
Gin Ser 



<210> 38 

<211> 345 

<212> DNA 

<213> Arabidopsis thai i ana 

<220> 

<221> CDS 

<222> (1) . . . (345) 
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<400> 38 

atg get teg gaa gaa gga caa gtg ate gee tgc cac acc gtt gag aca 48 
Met Ala Ser Glu Glu Gly Gin Val He Ala Cys His Thr Val Glu Thr 
15 10 15 

tgg aac gag cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg 96 
Trp Asn Glu Gin Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val 
20 25 30 

gtt gat ttc acg get tct tgg tgt gga cca tgt cgt ttc ate get cca 144 
Val Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe He Ala Pro 
35 40 45 

ttc ttt get gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag 192 
Phe Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys 
50 55 60 

gtt gat act gat gaa ttg aag teg gtg gca agt gat tgg gcg ata cag 240 
Val Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala He Gin 
65 70 75 80 

gcg atg cca acc ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa 288 
Ala Met Pro Thr Phe Met Phe Leu Lys Glu Gly Lys He Leu Asp Lys 
85 90 A 95 

gtt gtt gga gec aag aaa gat gag ctt cag tct acc att gec aaa cac 336 
Val Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr He Ala Lys His 
100 105 110 

ttg get taa 345 
Leu Ala * 



<210> 39 
<211> 114 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 39 

Met Ala Ser Glu Glu Gly Gin Val He Ala Cys His Thr Val Glu Thr 

15 10 15 

Trp Asn Glu Gin Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val 

20 25 30 

Val Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe He Ala Pro 

35 40 45 ' 

Phe Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys 

50 55 60 

Val Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala He Gin 
65 70 75 80 

Ala Met Pro Thr Phe Met Phe Leu Lys Glu Gly Lys He Leu Asp Lys 

85 90 95 

Val Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr He Ala Lys His 
100 105 110 

Leu Ala 



<210> 40 

<211> 999 

<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (1) . . . (999) 
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<400> 40 

atg aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt 48 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser 
15 10 15 

ggc cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt 96 
Gly Pro Ala Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu 
20 25 30 

aaa cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt 144 
Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 
35 40 45 

ggt caa etc aac caa cca ccg cgt gag aat ttc ccc gga ttt cca gaa 192 
Gly Gin Leu Asn Gin Pro Pro Arg Glu Asn Phe Pro Gly Phe Pro Glu 
50 55 60 

ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag 240 
Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 
65 70 " 75 80 

cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc 288 
Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 
85 90 95 

tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gec att etc get 336 
Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 
100 105 110 

gac get gtg att etc get ate gga get gtg get aag tgg ctt age ttc 384 
Asp Ala Val He Leu Ala He Gly Ala Val Ala Lys Trp Leu Ser Phe 
115 120 125 

gtt gga tct ggt gaa gtt etc gga ggt ttg tgg aac cgt gga ate tec 432 
Val Gly Ser Gly Glu Val Leu Gly Gly Leu Trp Asn Arg Gly He Ser 
130 135 " 140 

get tgt get gtt tgc gac gga get get ccg ata ttc cgc aac aaa cct 480 
Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro 
145 150 155 160 

ctt gcg gtg ate ggt gga ggc gat tct gca atg gaa gaa gca aac ttt 528 
Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 
165 170 175 

ctt aca aaa tat gga tct aaa gtg tat ata ate gat agg aga gat get 576 
Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He Asp Arg Arg Asp Ala 
180 " 185 . 190 

ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct aag 624 
Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 
195 200 205 

att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga 672 
He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 
210 215 220 

gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt ace gga 720 
Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 
225 230 ^ 235 240 



gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat 
Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His 
245 250 255 

gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat 
Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp 



768 



816 
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260 265 270 

ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga 864 
Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Glri Thr Ser Val Pro Gly 
275 280 285 

gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gec ate 912 
Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala lie 
290 295 300 

act get gca gga act ggg tgc atg gca get ttg gat gca gag cat tac 960 
Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr 
305 310 315 320 

tta caa gag att gga tct cag caa ggt aag agt gat tga 999 
Leu Gin Glu lie Gly Ser Gin Gin Gly Lys Ser Asp * 
325 330 



<210> 41 

<211> 332 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 41 



Met 


Asn 


Gly 


Leu 


Glu 


Thr 


His Asn 


1 






5 






Gly 


Pro 


Ala 


Ala 


Hxs 


Thr 


Ala Ala 






20 








Lys 


Pro 


Leu 
35 


Leu 


Phe 


Glu 


Gly Trp 
40 


Gly 


Gin 


Leu 


Asn 


Gin 


Pro 


Pro Arg 


50 










55 


Gly 


lie 


Leu 


Gly 


Val 


Glu 


Leu Thr 


65 










70 




Arg 


Phe 


Gly 


Thr 


Thr 


He 


Phe Thr 






85 






Ser 


Ser 


Lys 


Pro 


Phe 


Lys 


Leu Phe 






100 






Asp 


Ala 


Val 


He 


Leu 


Ala 


He Gly 




115 








120 


Val 


Gly 
130 


Ser 


Gly 


Glu 


Val 


Leu Gly 
135 


Ala 


Cys 


Ala 


Val 


Cys 


Asp 


Gly Ala 


145 










150 




Leu 


Ala 


Val 


He 


Gly 
165 


Gly 


Gly Asp 


Leu 


Thr 


Lys 


Tyr 
180 


Gly 


Ser 


Lys Val 


Phe 


Arg 


Ala 
195 


Ser 


Lys 


He 


Met Gin 
200 


lie 


Asp 
210 


Val 


He 


Trp 


Asn 


Ser Ser 
215 


Glu 


Arg 


Asp 


Val 


Leu 


Gly 


Gly Leu 


225 










230 




Asp 


Val 


Ser 


Asp 


Leu 
245 


Lys 


Val Ser 


Glu 


Pro 


Ala 


Thr 
260 


Lys 


Phe 


Leu Asp 


Gly 


Tyr 


Val 
275 


Val 


Thr 


Lys 


Pro ,Gly 
280 


Val 


Phe 
290 


Ala 


Ala 


Gly 


Asp 


Val Gin 
295 


Thr 


Ala 


Ala 


Gly 


Thr 


Gly 


Cys Met 


305 










310 




Leu 


Gin 


Glu 


He 


Gly 
325 


Ser 


Gin Gin 



Thr 


Arg 


Leu Cys He Val Gly 


Ser 




10 


15 




He 


Tyr 


Ala Ala Arg Ala Glu 


Leu 


25 




30 




Met 


Ala 


Asn Asp He Ala Pro 


Gly 






45 




Glu 


Asn 


Phe Pro Gly Phe Pro 


Glu 






60 




Asp 


Lys 


Phe Arg Lys Gin Ser 


Glu 






75 


O ft 

80 


Glu 


Thr 


Val Thr Lys Val Asp 


Phe 




90 


95 




Thr 


Asp 


Ser Lys Ala He Leu 


Ala 


105 




110 




Ala 


Val 


Ala Lys Trp Leu Ser 


Phe 






125 




Gly 


Leu 


Trp Asn Arg Gly He 


Ser 






140 




Ala 


Pro 


He Phe Arg Asn Lys 


Pro 






155 


160 


Ser 


Ala 


Met Glu Glu Ala Asn 


Phe 




170 


175 




Tyr 


He 


He Asp Arg Arg Asp 


Ala 


185 




190 




Gin 


Arg 


Ala Leu .Ser Asn Pro 


Lys 






205 




Val 


Val 


Glu Ala Tyr Gly Asp 


Gly 






220 




Lys 


Val 


Lys Asn Val Val Thr 


Gly 






235 


240 


Gly 


Leu 


Phe Phe Ala He, Gly 


His 




250 


255 




Gly 


Gly 


Val Glu Leu Asp Ser 


Asp 


265 




270 




Thr 


Thr 


Gin Thr Ser Val Pro 


Gly 






285 




Asp 


Lys 


Lys Tyr Arg Gin Ala 


He 






300 




Ala 


Ala 


Leu Asp Ala Glu His 


Tyr 






315 


320 


Gly 


Lys 


Ser Asp 





330 



-41- 



WO 02/50289 



PCT/US01/50240 



<210> 42 
<211> 332 
<212> DNA 
<213> E. coli 

<220> 

<221> CDS 

<222> (1) . . . (332) 

<400> 42 

atg age gat aaa att att cac ctg act gac gac agt ttt gac acg gat 48 

Met Ser Asp Lys lie lie His Leu Thr Asp Asp Ser Phe Asp Thr Asp 
15 10 15 

gta etc aaa gcg gac ggg get ate etc gtt gat ttc tgg gca gag tgg 96 
Val Leu Lys Ala Asp Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp 
20 25 30 

tgc ggg ccg tgt aaa atg ate get ccg att ctg gat gaa ate get gac 144 
Cys Gly Pro Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp 
35 40 45 

gaa tat cag ggc aaa ttg ace gtt gee aaa ctg aac att gac cag aac 192 
Glu Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Asn lie Asp Gin Asn 
50 55 60 

cca ggt act gcg cct aaa tat ggc ate cgc ggt att ccg act ctg ctg 240 
Pro Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu 
65 70 75 80 

ctg ttt aaa aac ggc gaa gtg gcg gca ace aaa gta ggc gca ctg tct 288 
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 
85 90 95 

aaa ggt cag ttg aaa gag ttt etc gac gee aat ctg gcg taa ta - 332 
Lys Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala * 
100 105 



<210> 43 
<211> 109 
<212> PRT 
<213> E. coli 

<400> 43 

Met Ser Asp Lys He He His Leu Thr Asp Asp Ser Phe Asp Thr Asp 

15 10 15 

Val Leu Lys Ala Asp Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp 

20 25 30 

Cys Gly Pro Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp 

35 40 45 

Glu Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn 

50 55 60 

Pro Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr, Leu Leu 
65 70 ~ 75 80 

Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 

85 90 95 

Lys Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala 
100 105 



<210> 44 

<211> 966 

<212> DNA 

<213> E. coli 
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<220> 

<221> CDS 

<222> (1) . . . (966) 

<400> 44 

a tg ggc acg acc aaa cac agt aaa ctg ctt ate ctg ggt tea ggc ccg 48 

Met Gly Thr Thr Lys His Ser Lys Leu Leu lie Leu Gly Ser Gly Pro 

15 10 15 

gcg gga tac acc get get gtc tac gcg gcg cgc gec aac ctg caa cct 96 
Ala Gly Tyr Thr Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Gin Pro 
20 25 30 

gtg ctg att acc ggc atg gaa aaa ggc ggc caa ctg acc acc acc acg 144 
Val Leu lie Thr Gly Met Glu Lys Gly Gly Gin Leu Thr Thr Thr Thr 
35 40 45 

gaa gtg gaa aac tgg cct ggc gat cca aac gat ctg acc ggt ccg tta 192 
Glu Val Glu Asn Trp Pro Gly Asp Pro Asn Asp Leu Thr Gly Pro Leu 
50 55 60 

tta atg gag cgc atg cac gaa cat gec acc aag ttt gaa act gag ate 240 
Leu Met Glu Arg Met His Glu His Ala Thr Lys Phe Glu Thr Glu lie 
65 70 75 80 

att ttt gat cat ate aac aag gtg gat ctg caa aac cgt ccg ttc cgt 288 
He Phe Asp His He Asn Lys Val Asp Leu Gin Asn Arg Pro Phe Arg 
85 90 95 

ctg aat ggc gat aac ggc gaa tac act tgc gac gcg ctg att att gec 336 
Leu Asn Gly Asp Asn Gly Glu Tyr Thr Cys Asp Ala Leu He He Ala 
100 105 110 

acc gga get tct gca cgc tat etc ggc ctg ccc tct gaa gaa gee ttt 384 
Thr Gly Ala Ser Ala Arg Tyr Leu Gly Leu Pro Ser Glu Glu Ala Phe 
115 120 125 

aaa ggc cgt ggg gtt tct get tgt gca acc tgc gac ggt ttc ttc tat 432 
Lys Gly Arg Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe Tyr 
130 135 140 

cgc aac cag aaa gtt gcg gtc ate ggc ggc ggc aat acc gcg gtt gaa 480 
Arg Asn Gin Lys Val Ala Val He Gly Gly Gly Asn Thr Ala Val Glu 
145 150 155 160 

gag gcg ttg tat ctg tct aac ate get teg gaa gtg cat ctg att cac 528 
Glu Ala Leu Tyr Leu Ser Asn He Ala Ser Glu Val His Leu He His 
165 170 175 

cgc cgt gac ggt ttc cgc gcg gaa aaa ate etc att aag cgc ctg atg 576 
Arg Arg Asp Gly Phe Arg Ala Glu Lys He Leu He Lys Arg Leu Met 
180 185 190 

gat aaa gtg gag aac ggc aac ate att ctg cac acc aac cgt acg ctg 624 
Asp Lys Val Glu Asn Gly Asn He He Leu His Thr Asn Arg Thr Leu 
195 200 205 

gaa gaa gtg acc ggc gat caa atg ggt gtc act ggc gtt cgt ctg cgc 672 
Glu Glu Val Thr Gly Asp Gin Met Gly Val Thr Gly Val Arg Leu Arg 
210 215 220 

gat acg caa aac age gat aac ate gag tea etc gac gtt gee ggt ctg 720 
Asp Thr Gin Asn Ser Asp Asn He Glu Ser Leu Asp Val Ala Gly Leu 
225 230 235 240 

ttt gtt get ate ggt cac age ccg aat act gcg att ttc gaa ggg cag 768 
Phe Val Ala He Gly His Ser Pro Asn Thr Ala He Phe Glu Gly Gin 
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245 250 255 

ctg gaa ctg gaa aac ggc tac ate aaa gta cag teg ggt att cat ggt 816 
Leu Glu Leu Glu Asn Gly Tyr lie Lys Val Gin Ser Gly lie His Gly 
260 265 270 

aat gec ace cag acc age att cct ggc gtc ttt gec gca ggc gac gtg 864 
Asn Ala Thr Gin Thr Ser lie Pro Gly Val Phe Ala Ala Gly Asp Val 
275 280 285 

atg gat cac att tat cgc cag gec att act teg gec ggt aca ggc tgc 912 
Met Asp His lie Tyr Arg Gin Ala lie Thr Ser Ala Gly Thr Gly Cys 
290 295 300 

atg gca gca ctt gat gcg gaa cgc tac etc gat ggt tta get gac gca 960 
Met Ala Ala Leu Asp Ala Glu Arg Tyr Leu Asp Gly Leu Ala Asp Ala 
305 310 315 320 

aaa taa 966 
Lys * 



<210> 45 


















<211> 321 


















<212> PRT 


















<213> E. coli 
















<400> 45 


















Met 


Gly 


Thr 


Thr 


Lys His 


Ser Lys Leu Leu He 


Leu 


Gly 


Ser 


Gly 


Pro 


1 






5 


10 








15 




Ala 


Gly 


Tyr 


Thr 


Ala Ala 


Val Tyr Ala Ala Arg 


Ala 


Asn 


Leu 


Gin 


Pro 




20 




25 






30 






Val 


Leu 


He 


Thr 


Gly Met 


Glu Lys Gly Gly Gin 


Leu 


Thr 


Thr 


Thr 


Thr 






35 






40 




45 








Glu 


Val 


Glu 


Asn 


Trp Pro 


Gly Asp Pro Asn Asp 


Leu 


Thr 


Gly 


Pro 


Leu 




50 








55 


60 










Leu 


Met 


Glu 


Arg 


Met His 


Glu His Ala Thr Lys 


Phe 


Glu 


Thr 


Glu 


lie 


65 






70 


75 










80 


lie 


Phe 


Asp 


His 


He Asn 


Lys Val Asp Leu Gin 


Asn 


Arg 


Pro 


Phe 


Arg 








85 


90 








95 




Leu 


Asn 


Gly 


Asp 


Asn Gly 


Glu Tyr Thr Cys Asp 


Ala 


Leu 


He 


He 


Ala 








100 




105 






110 






Thr 


Gly 


Ala 


Ser 


Ala Arg 


Tyr Leu Gly Leu Pro 


Ser 


Glu 


Glu 


Ala 


Phe 






115 






120 




125 








Lys 


Gly 


Arg 


Gly 


Val Ser 


Ala Cys Ala Thr Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 




130 








135 


140 










Arg 


Asn 


Gin 


Lys 


Val Ala 


Val He Gly Gly Gly 


Asn 


Thr 


Ala 


Val 


Glu 


145 








150 


155 










160 


Glu 


Ala 


Leu 


Tyr 


Leu Ser 


Asn He Ala Ser Glu 


Val 


His 


Leu 


He 


His 








165 


170 








175 




Arg 


Arg 


Asp 


Gly 


Phe Arg 


Ala Glu Lys He Leu 


He 


Lys 


Arg 


Leu 


Met 








180 




185 






190 






Asp 


Lys 


Val 


Glu 


Asn Gly 


Asn lie He Leu His 


Thr 


Asn 


Arg 


Thr 


Leu 


195 




200 




205 








Glu 


Glu 


Val 


Thr 


Gly Asp 


Gin Met Gly Val Thr 


Gly 


Val 


Arg, 


Leu 


Arg 




210 








215 


220 










Asp 


Thr 


Gin 


Asn 


Ser Asp 


Asn He Glu Ser Leu 


Asp 


Val 


Ala 


Gly 


Leu 


225 








230 


235 










240 


Phe 


Val 


Ala 


He 


Gly His 


Ser Pro Asn Thr Ala 


He 


Phe 


Glu 


Gly 


Gin 










245 


250 








255 




Leu 


Glu 


Leu 


Glu 


Asn Gly 


Tyr He Lys Val Gin 


Ser 


Gly 


lie 


His 


Gly 








260 




265 






270 






Asn 


Ala 


Thr 


Gin 


Thr Ser 


He Pro Gly Val Phe 


Ala 


Ala 


Gly 


Asp 


Val 






275 






280 




285 








Met 


Asp 


His 


He 


Tyr Arg 


Gin Ala He Thr Ser 


Ala 


Gly 


Thr 


Gly 


Cys 




290 








295 


300 
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Met Ala Ala Leu Asp Ala Glu Arg Tyr Leu Asp Gly Leu Ala Asp Ala 
305 310 " 315 320 

Lys 



<210> 46 

<211> 318 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (318) 

<400> 46 

atg gtg aag cag ate gag age aag act get ttt cag gaa gec ttg gac 48 
Met Val Lys Gin lie Glu Ser Lys Thr Ala Phe Gin Glu Ala Leu Asp 
15 10 15 

get gca ggt gat aaa ctt gta gta gtt gac ttc tea gee acg tgg tgt 96 
Ala Ala Gly Asp Lys Leu Val Val Val Asp Phe Ser Ala Thr Trp Cys 
20 25 30 

ggg cct tgc aaa atg ate aag cct ttc ttt cat tec etc tct gaa aag 144 
Gly Pro Cys Lys Met lie Lys Pro Phe Phe His Ser Leu Ser Glu Lys 
35 40 45 

tat tec aac gtg ata ttc ctt gaa gta gat gtg gat gac tgt cag gat 192 
Tyr Ser Asn Val lie Phe Leu Glu Val Asp Val Asp Asp Cys Gin Asp 
50 55 60 

gtt get tea gag tgt gaa gtc aaa tgc atg cca aca ttc cag ttt ttt 240 
Val Ala Ser Glu Cys Glu Val Lys Cys Met Pro Thr Phe Gin Phe Phe 
65 70 75 80 

aag aag gga caa aag gtg ggt gaa ttt tct gga gee aat aag gaa aag 288 
Lys Lys Gly Gin Lys Val Gly Glu Phe Ser Gly Ala Asn Lys Glu Lys 
85 90 95 

ctt gaa gee ace att aat gaa tta gtc taa 318 
Leu Glu Ala Thr lie Asn Glu Leu Val * 
100 105 



<210> 47 

<211> 105 

<212> PRT 

<213> Homo Sapien 



<400> 47 



Met Val 


Lys 


Gin 


lie 


Glu Ser Lys 


1 






5 




Ala Ala 


Gly 


Asp 


Lys 


Leu Val Val 






20 






Gly Pro 


Cys 


Lys 


Met 


lie Lys Pro 




35 






40 


Tyr Ser 


Asn 


Val 


lie 


Phe Leu Glu 


50 








55 


Val Ala 


Ser 


Glu 


Cys 


Glu Val Lys 


65 








70 


Lys Lys 


Gly 


Gin 


Lys 


Val Gly Glu 








85 




Leu Glu 


Ala 


Thr 


lie 


Asn Glu Leu 






100 







Thr Ala Phe Gin Glu Ala Leu Asp 

10 15 
Val Asp Phe Ser Ala Thr Trp Cys 
25 30 
Phe Phe His Ser Leu Ser Glu Lys 
45 

Val Asp Val Asp Asp Cys Gin Asp 
60 

Cys Met Pro Thr Phe Gin Phe Phe 

75 80 
Phe Ser Gly Ala Asn Lys Glu Lys 
90 95 

Val 
105 
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<210> 48 

<211> 1494 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (1494) 

<400> 48 

atg aac ggc cct gaa gat ctt ccc aag tec tat gac tat gac ctt ate 48 

Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp Tyr Asp Leu lie 

1 5 10 15 

ate att gga ggt ggc tea gga ggt ctg gca get get aag gag cca gec 96 
lie lie Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Pro Ala 
20 25 30 

caa tat ggc aag aag gtg atg gtc ctg gac ttt ggc act ccc acc cct 144 
Gin Tyr Gly Lys Lys Val Met Val Leu Asp Phe Gly Thr Pro Thr Pro 
35 " 40 45 

ctt gga act aga tgg ggt ctt gga gga aca tgt gtg aat gtg ggt tgc 192 
Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 
50 55 60 

ata cct aaa aaa ctg atg cat caa gca get ttg tta gga caa gec ctg 240 
lie Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 80 

caa gac tct cga aat tat gga tgg aaa gtc gag gag aca gtt aag cat 288 
Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu Thr Val Lys His 
85 ~ 90 95 

gat tgg gac aga atg ata gaa get gta cag aat cac att ggc tct ttg 336 
Asp Trp Asp Arg Met lie Glu Ala Val Gin Asn His lie Gly Ser Leu 
100 105 110 

aat tgg ggc tac cga gta get ctg egg gag aaa aaa gtc gtc tat gag 384 
Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 
115 120 125 

aat get tat ggg caa ttt att ggt cct cac agg att aag gca aca aat 432 * 
Asn Ala Tyr Gly Gin Phe lie Gly Pro His Arg lie Lys Ala Thr Asn 
130 135 140 

aat aaa ggc aaa gaa aaa att tat tea gca gag aga ttt etc att gec 480 
Asn Lys Gly Lys Glu Lys lie Tyr Ser Ala Glu Arg Phe Leu lie Ala 
145 150 155 160 

act ggt gaa aga cca cgt tac ttg ggc ate cct ggt gac aaa gaa tac 528 
Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 
165 170 175 

tgc ate age agt gat gat ctt ttc tec ttg cct tac tgc ccg ggt aag 576 
Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 
180 185 190' 

aca ctg gtt gtt gga gca tec tat gtc get ttg gag tgc get gga ttt 624 
Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 
195 200 205 

ctt get ggt att ggt tta gac gtc act gtt atg gtt agg tec att ctt 672 
Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 
210 215 220 

ctt aga gga ttt gac cag gac atg gec aac aaa att ggt gaa cac atg 720 
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Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys lie Gly Glu His Met 
225 ~ " 230 * 235 240 

gaa gaa cat ggc ate aag ttt ata' aga cag ttc gta cca att aaa gtt 768 
Glu Glu His Gly lie Lys Phe lie Arg Gin Phe Val Pro He Lys Val 
245 250 255 

gaa caa att gaa gca ggg aca cca ggc cga etc aga gta gta get cag 816 
Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Val Ala Gin 
260 265 270 

tec ace aat agt gag gaa ate att gaa gga gaa tat aat acg gtg atg 864 
Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr Asn Thr Val Met 
275 280 285 

ctg gca ata gga aga gat get tgc aca aga aaa att ggc tta gaa acc 912 
Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He Gly Leu Glu Thr 
290 295 300 

gta ggg gtg aag ata aat gaa aag act gga aaa ata cct gtc aca gat 960 
Val Gly Val Lys lie Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
305 310 315 320 

gaa gaa cag acc aat gtg cct tac ate tat gee att ggc gat ata ttg 1008 
Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 
325 330 335 

gag gat aag gtg gag etc acc cca gtt gca ate cag gca gga aga ttg 1056 
Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 
340 345 350 

ctg get cag agg etc tat gca ggt tec act gtc aag tgt gac tat gaa 1104 
Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys Cys Asp Tyr Glu 
355 360 365 

aat gtt cca acc act gta ttt act cct ttg gaa tat ggt get tgt ggc 1152 
Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Ala Cys Gly 
370 375 380 

ctt tct gag gag aaa get gtg gag aag ttt ggg gaa gaa aat att gag 1200 
Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 
385 390 395 400 

gtt tac cat agt tac ttt tgg cca. ttg gaa tgg acg att ccg tea aga 1248 
Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr He Pro Ser Arg 
405 410 415 

gat aac aac aaa tgt tat gca aaa ata ate tgt aat act aaa gac aat 1296 
Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn Thr Lys Asp Asn 
420 425 430 

gaa cgt gtt gtg ggc ttt cac gta ctg ggt cca aat get gga gaa gtt 1344 
Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 
435 440 445 

aca caa ggc ttt gca get gcg etc aaa tgt gga ctg acc aaa, aag cag 1392 
Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu Thr Lys Lys Gin 
450 " 455 460 

ctg gac age aca att gga ate cac cct gtc tgt gca gag gta ttc aca 1440 
Leu Asp Ser Thr He Gly He His Pro Val Cys Ala Glu Val Phe Thr 
465 470 475 480 

aca ttg tct gtg acc aag cgc tct ggg gca age ate etc cag get ggc 1488 
Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser He Leu Gin Ala Gly 
485 490 495 
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tgc tga 1494 
Cys * 



<210> 49 

<211> 497 

<212> PRT 

<213> Homo sapien 

<400> 49 

Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp Tyr Asp Leu lie 

15 10 15 

He He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Pro Ala 

20 25 30 

Gin Tyr Gly Lys Lys Val Met Val Leu Asp Phe Gly Thr Pro Thr Pro 

35 40 45 

Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 

50 55 60 

He Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 80 

Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu Thr Val Lys His 

85 90 95 

Asp Trp Asp Arg Met He Glu Ala Val Gin Asn His He Gly Ser Leu 

100 105 110 

Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 

115 120 125 

Asn Ala Tyr Gly Gin Phe He Gly Pro His Arg He Lys Ala Thr Asn 

130 135 140 

Asn Lys Gly Lys Glu Lys He Tyr Ser Ala Glu Arg Phe Leu He Ala 
145 150 155 160 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 

165 170 175 

Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 185 190 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

195 200 205 

Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 

210 215 220 

Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He Gly Glu His Met 
225 230 235 240 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro He Lys Val 

245 250 255 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Val Ala Gin 

260 265 270 

Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr Asn Thr Val Met 

275 280 285 

Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He Gly Leu Glu Thr 

290 295 300 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
305 310 ~ 315 320 

Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 

325 330 335 

Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350 

Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys Cys Asp Tyr Glu 

355 360 365 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Ala Cys Gly 

370 375 380 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 
385 390 395 400 

Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr He Pro Ser Arg 

405 410 415 

Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn Thr Lys Asp Asn 

420 425 430 

Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 
435 440 445 
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Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu Thr Lys Lys Gin 

450 455 460 

Leu Asp Ser Thr lie Gly lie His Pro Val Cys Ala Glu Val Phe Thr 
465 470 475 480 

Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser lie Leu Gin Ala Gly 
485 490 495 

Cys 



<210> 50 
<211> 1377 
<212> DNA 

<213> Mycobacterium leprae 

<220> 
<221> CDS 

<222> (1) . . . (1377) 
<400> 50 

atg aac acc act cct tct gcg cat gag acg at a cac gaa gtg ate gtt 48 
Met Asn Thr Thr Pro Ser Ala His Glu Thr lie His Glu Val lie Val 
1 5 10 15 

att ggc tec ggt cca gca ggc tac act get gee ctg tac gee get cgt 96 
lie Gly Ser Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg 
20 '25 30 

gca cag eta aca ccg ctg gta ttt gag ggt acc tea ttc ggc ggc gcg 144 
Ala Gin Leu Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala 
35 40 45 • 

ctg atg acc acc acc gag gtg gaa aac tac cca ggt ttt cgc aac ggc 192 
Leu Met Thr Thr Thr Glu Val Glu Asn Tyr Pro Gly Phe Arg Asn Gly 
50 55 60 

ata acc ggc ccg gag ttg atg gac gat atg cgt gaa cag gca ctg cga 240 
He Thr Gly Pro Glu Leu Met Asp Asp Met Arg Glu Gin Ala Leu Arg 
65 70 75 80 

ttc ggc gcg gaa ctg egg acc gaa gac gtc gag teg gta tea ttg cgt 288 
Phe Gly Ala Glu Leu Arg Thr Glu Asp Val Glu Ser Val Ser Leu Arg 
85 90 95 

ggc ccg ate aaa teg gtc gtc acc get gaa gga cag act tat cag gee 336 
Gly Pro He Lys Ser Val Val Thr Ala Glu Gly Gin Thr Tyr Gin Ala 
100 105 110 

cga gee gtc ate etc gee atg ggt acc tec gtg cgt tat eta cag ate 384 
Arg Ala Val He Leu Ala Met Gly Thr Ser Val Arg Tyr Leu Gin He 
115 120 125 

ccc ggc gag caa gaa ttg eta gga cgt ggc gtg agt gca tgc gcg acc 432 
Pro Gly Glu Gin Glu Leu Leu Gly Arg Gly Val Ser Ala Cys Ala Thr 
130 135 140 

tgc gac ggg tec ttt ttc cgc ggc caa gac att gee gtc att ggc ggt 480 
Cys Asp Gly Ser Phe Phe Arg Gly Gin Asp He Ala Val He Gly Gly 
145 150 *" 155 160 

gga gac tea gcg atg gag gaa gee etc ttt ttg acc egg ttc gec cgc 528 
Gly Asp Ser Ala Met Glu Glu Ala Leu Phe Leu Thr Arg Phe Ala Arg 
165 170 175 

age gtc acg etc gtg cac cgc cgc gac gaa ttc cga get tct aag ate 576 
Ser Val Thr Leu Val His Arg Arg Asp Glu Phe Arg Ala Ser Lys He 
180 185 190 
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atg etc ggt cgc gec cgt aac aat gac aag ate aaa ttc ate acc aac 624 
Met Leu Gly Arg Ala Arg Asn Asn Asp Lys lie Lys Phe He Thr Asn 
195 200 205 

cac acc gtg gtc gcg gtg aac ggg tat aca aca gtg acc gga ttg egg 672 
His Thr Val Val Ala Val Asn Gly Tyr Thr Thr Val Thr Gly Leu Arg 
210 215 220 

ttg cgt aac acc aca acg gga gag gaa acc acg eta gta gtg acc ggg 720 
Leu Arg Asn Thr Thr Thr Gly Glu Glu Thr Thr Leu Val Val Thr Gly 
225 230 235 240 

gtt ttt gtt gca att ggc cat gaa cca cgt tec age ctg gtg age gat 768 
Val Phe Val Ala He Gly His Glu Pro Arg Ser Ser Leu Val Ser Asp 
245 250 255 

gtc gtc gac ata gac ccg gat ggc tac gtc ctg gtg aaa gga cgt acg 816 
Val Val Asp He Asp Pro Asp Gly Tyr Val Leu Val Lys Gly Arg Thr 
260 265 270 

acg agt aca teg atg gac ggc gtt ttt gcg gee ggc gac ctg gta gat 864 
Thr Ser Thr Ser Met Asp Gly Val Phe Ala Ala Gly Asp Leu Val Asp 
275 280 285 

cgc acc tac egg cag gcg ate act gec gca ggt agt ggc tgt gee gee 912 
Arg Thr Tyr Arg Gin Ala He Thr Ala Ala Gly Ser Gly Cys Ala Ala 
290 295 "* 300 

gec ate gac gec gaa cgt tgg ttg gcg gag cat gec ggg tea aaa get 960 
Ala He Asp Ala Glu Arg Trp Leu Ala Glu His Ala Gly Ser Lys Ala 
305 310 315 320 

aac gaa aca aca gag gaa act gga gac gtt gac agt acc gac aca acc 1008 
Asn Glu Thr Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr Thr 
325 330 335 

gat tgg age act gcg atg act gac gee aag aac gee ggg gtc aca ata 1056 
Asp Trp Ser Thr Ala Met Thr Asp Ala Lys Asn Ala Gly Val Thr He 
340 345 " 350 

gaa gtg acc gat get tec ttt ttc gca gac gtc tta tec agt aat aag 1104 
Glu Val Thr Asp Ala Ser Phe Phe Ala Asp Val Leu Ser Ser Asn Lys 
355 360 365 

cct gtg tta gtt gat ttt tgg gca aca tgg tgt gga ccc tgc aag atg 1152 
Pro Val Leu Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys Lys Met 
370 375 380 

gta gcg ccg gta etc gaa gag ate gcg tec gaa caa cga aac cag etc 1200 
Val Ala Pro Val Leu Glu Glu He Ala Ser Glu Gin Arg Asn Gin Leu 
385 390 395 " 400 

act gtc gee aag tta gat gta gac acc aac ccg gaa atg gca cgc gag 1248 
Thr Val Ala Lys Leu Asp Val Asp Thr Asn Pro Glu Met Ala Arg Glu 
405 " 410 415 

ttc cag gtc gtg teg ata ccc aca atg att ctg ttc cag ggt ggc caa 1296 
Phe Gin Val Val Ser He Pro Thr Met He Leu Phe Gin Gly Gly Gin 
420 425 430 

cca gta aaa cgc ate gtt ggc get aag ggc aaa gca gcg tta eta cgt 1344 
Pro Val Lys Arg He Val Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg 
435 440 " 445 

gac ctt tec gac gtg gta cct aac etc aat tag 1377 
Asp Leu Ser Asp Val Val Pro Asn Leu Asn * 
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450 455 



<210> 51 

<211> 458 

<212> PRT 

<213> Mycobacterium leprae 



<400> 51 
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ulU 
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Aia lieu irne 
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loo 






1 / u 
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Leu 


vai 
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Arg 


Arg Asp Glu 


Phe Arg Ala Ser Lys Xle 








loU 








lOJ 


1 Oft 
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Leu 


uiy 


Arg 
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Arg 


Asn 


Asn Asp Lys 


lie Lys Phe Xle Thr Asn 






l!?0 










o n n 
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ms 


Tnr 
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vai 
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vai 


Asn 


Gly Tyr Thr 


Thr Val Thr Gly Leu Arg 




Z 1U 
















_ 
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vaiu biu inr 
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z / u 
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m ir 
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Va 1 "Dr-io 7il =a 

vai irne Aia 


Aia uiy Asp iieu vai Asp 
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Arg 
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325 






330 


335 


Asp 


Trp 


Ser 


Thr 


Ala 


Met 


Thr 


Asp Ala Lys 


Asn Ala Gly Val Thr lie 








340 








345 


350 


Glu 


Val 


Thr 


Asp 


Ala 


Ser 


Phe 


Phe Ala Asp 


Val Leu Ser Ser Asn Lys 






355 










360 


365 


Pro 


Val 


Leu 


Val 


Asp 


Phe 


Trp 


Ala Thr Trp 


Cys Gly Pro Cys Lys Met 




370 










375 




380 


Val 


Ala 


Pro 


Val 


Leu 


Glu 


Glu 


lie Ala Ser 


Glu Gin Arg Asn Gin Leu 


385 










390 






395 400 


Thr 


Val 


Ala 


Lys 


Leu 


Asp 


Val 


Asp Thr Asn 


Pro Glu Met Ala Arg Glu 










405 






. 410 


415 


Phe 


Gin 


Val 


Val 


Ser 


lie 


Pro 


Thr Met He 


Leu Phe Gin Gly Gly Gin 








420 








425 


430 


Pro 


Val 


Lys 


Arg 


lie 


Val 


Gly 


Ala Lys Gly 


Lys Ala Ala Leu Leu Arg 






435 










440 


445 


Asp 


Leu 


Ser 


Asp 


Val 


Val 


Pro 


Asn Leu Asn 





450 455 
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<210> 52 
<211> 178 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 52 

Met Pro Leu Ser Leu Arg Leu Ser Pro Ser Pro Thr Ala Leu Ser Pro 

1*5 10 15 

Thr Thr Gly Gly Phe Gly Pro Ser Arg Lys Gin Cys Arg lie Pro Tyr 

20 25 30 

Ser Gly Val Pro Thr Thr Lys He Gly Phe Cys Ser Leu Asp Ser Arg 

35 40 45 

Lys Arg Gly Asp Ser Ser Val Val Arg Cys Ser Leu Glu Thr Val Asn 

50 55 60 

Val Ser Val Gly Gin Val Thr Glu Val Asp Lys Asp Thr Phe Trp Pro 
65 70 75 80 

He Val Lys Ala Ala Gly Glu Lys Leu Val Val Leu Asp Met Tyr Thr 

85 90 95 

Gin Trp Cys Gly Pro Cys Lys Val He Ala Pro Lys Tyr Lys Ala Leu 

100 105 110 

Ser Glu Lys Tyr Asp Asp Val Val Phe Leu Lys Leu Asp Cys Asn Pro 
115 ~ 

Asp Asn Arg Pro Leu Pro Lys Glu Leu Gly He Arg Val Val Pro Thr 

130 135 140 

Phe Lys He Leu Lys Asp Asn Lys Val Val Lys Glu Val Thr Gly Ala 
145 150 155 160 

Lys Tyr Asp Asp Leu Val Ala Ala He Glu Thr Ala Arg Ser Ala Ala 
165 170 175 

Ser Gly 



<210> 53 
<211> 185 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 53 



Met 


Pro 


Leu 


Ser 


Leu 


Arg 


Leu 


Ala Pro Ser Pro Thr Ser Phe Arg Tyr 


1 








5 




10 15 


Ser 


Pro 


He 


Thr 


Ser 


Thr 


Gly 


Ala Gly Gly Phe Ser Pro Val Lys Gin 








20 








25 30 


His 


Cys 


Arg 


He 


Pro 


Asn 


Ser 


Gly Val Ala Thr Lys He Gly Phe Cys 






35 










40 45 


Ser 


Gly 


Gly 


Gly 


Gly 


Val 


Leu 


Asp Ser Gly Arg Arg He Gly Ser Cys 




50 










55 


60 


Val 


Val 


Arg 


Cys 


Ser 


Leu 


Glu 


Thr Val Asn Val Thr Val Gly Gin Val 


65 










70 




75 80 


Thr 


Glu 


Val 


Asp 


Lys 


Asp 


Thr 


Phe Trp Pro He Val Lys Ala Ala Gly 










85 






90 95 


Asp 


Lys 


lie 


Val 


Val 


Leu 


Asp 


Met Tyr Thr Gin Trp Cys Gly Pro Cys 








100 








105 110 


Lys 


Val 


He 


Ala 


Pro 


Lys 


Tyr 


Lys Glu Leu Ser Glu Lys Tyr Gin Asp 






115 










120 125 


Met 


Val 


Phe 


Leu 


Lys 


Leu 


Asp 


Cys Asn Gin Asp Asn Lys Pro Leu Ala 




130 










135 


140 


Lys 


Glu 


Leu 


Gly 


He 


Arg 


Val 


Val Pro Thr Phe Lys He Leu Lys Asp 


145 










150 




155 160 


Asn 


Lys 


Val 


Val 


Lys 


Glu 


Val 


Thr Gly Ala Lys Tyr Glu Asp Leu Leu 










165 






170 175 


Ala 


Ala 


He 


Glu 


Ala 


Ala 


Arg 


Ser Gly 








180 








185 



<210> 54 
<211> 182 
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<212> PRT 

<213> Brassica napus 
<400> 54 



Met Pro 


Leu 


Ser 


Leu 


Arg 


Leu 


Ala 


Pro 


Ser Pro 


Thr 


Ala Leu Ser Pro 


1 






5 








10 




15 


Thr Thr 


Gly 


Gly 


Phe 


Ser 


Pro 


Ala 


Lys 


Lys Gin 


Cys 


Arg He Pro Ser 






20 










25 






30 


Tyr Ser 


Gly 


Val 


Ala 


Thr 


Thr 


Thr 


Arg 


Arg He Gly 


Leu Cys Ser Leu 




35 










40 








45 


Asp Tyr 


Val 


Lys 


Arg 


Gly 


Asp 


Ser 


Ser 


Val Val 


Arg 


Cys Ser Leu Gin 


50 










55 








60 




Thr Val 


Asn 


Val 


Ser 


Val 


Gly 


Gin 


Val 


Thr Glu 


Val 


Asp Lys Asp Thr 


65 








70 








75 




80 


Phe Trp 


Pro 


He 


Val 


Lys 


Ala 


Ala 


Gly 


Glu Lys 


He 


Val Val Leu Asp 








85 










90 




95 


Met Tyr 


Thr 


Gin 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys Val He 


Ala Pro Lys Tyr 






100 










105 






110 


Lys Ala 


Leu 


Ser 


Glu 


Lys 


Tyr 


Glu 


Asp 


Val Val 


Phe 


Leu Lys Leu Asp 




115 










120 








125 


Cys Asn 


Pro 


Glu 


Asn 


Arcr 


Pro 


Leu 


Ala 


Lys Glu 


Leu 


Gly He Arg Val 


130 










135 








140 




Val Pro 


Thr 


Phe 


Lvs 


He 


Leu 


Lvs 


Asp 


Asn Gin 


Val 


Val Lys Glu Val 


145 








150 








155 




160 


Thr Gly 


Ala 


Lys 


Tyr 


Asp 


Asp 


Leu 


Val 


Ala Ala 


lie 


Glu Thr Ala Arg 








165 










170 




175 


Ser Ala 


Ser 


Ser 


Ser 


Gly 


















180 


















<210> 55 




















<211> 191 




















<212> PRT 




















<213> Mesembryanthemum crystallinum 








<400> 55 




















Met Ala 


Met 


Gin 


Leu 


Ser 


Leu 


Ser 


His 


Gin Ser 


Trp 


Ala Lys Ser Leu 


1 






5 










10 




15 


Ala Ser 


Pro 


He 


Thr 


Ser 


Phe 


Asp 


Pro 


Ala Arg Ser 


Pro Pro Lys Arg 






20 










25 






30 


Val Glu 


Leu 


Gly 


Pro 


Asn 


Cys 


Leu 


Asn 


Gly Gly Ala 


Thr Ala Gly Lys 




35 










40 








45 


Leu Met 


Arg 


Glu 


Lys 


Val 


Gly 


Glu 


Arg 


Met Arg Met 


Ser Gly Arg Ser 


50 










55 








60 




Cys Cys 


Val 


Lys 


Ala 


Ser 


Leu 


Glu 


Thr 


Ala Val Gly 


Ala Glu Ser Glu 


65 








70 








75 




80 


Thr Leu 


Val 


Gly 


Lys 


Val 


Thr 


Glu 


Val 


Asp Lys Asp 


Thr Phe Trp Pro 








85 










90 




95 


He Ala 


Asn 


Gly 


Ala 


Gly 


Asp 


Lys 


Pro 


Val Val 


Leu 


Asp Met Tyr Thr 






100 










105 






110 


Gin Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Val 


Met 


Ala Pro Lys 


Tyr Gin Glu Leu 




115 










120 








125 


Ala Glu 


Lys 


Leu 


Leu 


Asp 


Val 


Val 


Phe 


Leu Lys 


Leu 


Asp Cys Asn Gin 


130 










135 








140 




Glu Asn 


Lys 


Pro 


Leu 


Ala 


Lys 


Glu 


Leu 


Gly He Arg 


Val Val Pro Thr 


145 








150 








155 




160 


Phe Lys 


He 


Leu 


Lys 


Gly 


Gly 


Lys 


He 


Val Asp Glu 


Val Thr Gly Ala 








165 










170 




175 


Lys Phe 


Asp 


Lys 


Leu 


Val 


Ala 


Ala 


He 


Glu Ala Ala 


Arg Ser Ser 




180 










185 






190 



<210> 56 
<211> 182 
<212> PRT 

<213> Pisum sativum 
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<400> 56 
Met Ala Leu 
1 

Phe Asp Ser 

Thr Thr Ser 
35 

Gin Arg Leu 
50 

Leu Glu Thr 
65 

Asn Lys Asp 

Val Val Leu 

Ala Pro Leu 
115 

Leu Lys Leu 

130 
Gly lie Lys 
145 

Val Lys Glu 
Asp Thr Val 



Asn Leu Cys 
5 

Ala Ser Ser 
20 

Phe Ser Ser 

Ser Leu Arg 

Ala Gly Pro 
70 

Thr Phe Trp 
85 

Asp Met Phe 
100 

Tyr Glu Glu 

Asp Cys Asn 

Val Val Pro 
150 

Val Thr Gly 

165 
Arg Ser Ser 
180 



Thr Ser 

Ser Lys 

Ser lie 

40 
Arg Ser 
55 

Thr Val 

Pro lie 

Thr Lys 

Leu Ser 
120 
Gin Asp 
135 

Thr Phe 
Ala Lys 



Pro Lys 

10 
Pro Ser 
25 

Leu Cys 

lie Ser 

Thr Val 

Val Asn 

90 
Trp Cys 
105 

Gin Lys 

Asn Lys 

Lys lie 

Phe Asp 
170 



Trp lie Gly Thr 
Leu Ala 
Ser Lys 



Val Ser 

60 
Gly Lys 
75 

Ala Ala 

Gly Pro 

Tyr Leu 

Ser Leu 
140 
Leu Lys 
155 

Asp Leu 



Ser Ser 

30 
Arg Val 
45 

Val Arg 



Val Thr 

Gly Asp 

Cys Lys 
110 
Asp Val 
125 

Ala Lys 
Asp Asn 
Val Ala 



Thr Val 
15 

Phe Ser 

Gly Leu 

Ser Ser 

Glu Val 

80 
Lys Thr 
95 

Val lie 

Val Phe 

Glu Leu 

Lys lie 
160 
Ala lie 
175 



<210> 57 
<211> 190 
<212> PRT 

<213> Spinacia oleracea 
<400> 57 

Met Ala Leu His Leu Ser Leu Ser His Gin Ser Trp Thr Ser Pro Ala 

1 5 10 15 

His Pro He Thr Ser Ser Asp Pro Thr Arg Ser Ser Val Pro Gly Thr 

20 25 30 

Gly Leu Ser Arg Arg Val Asp Phe Leu Gly Ser Cys Lys He Asn Gly 

35 ~ 40 ~ 45 

Val Phe Val Val Lys Arg Lys Asp Arg Arg Arg Met Arg Gly Gly Glu 

50 55 60 

Val Arg Ala Ser Met Glu Gin Ala Leu Gly Thr Gin Glu Met Glu Ala 
65 70 75 80 

He Val Gly Lys Val Thr Glu Val Asn Lys Asp Thr Phe Trp Pro He 

85 90 95 

Val Lys Ala Ala Gly Asp Lys Pro Val Val Leu Asp Met Phe Thr Gin 

100 105 *" 110 

Trp Cys Gly Pro Cys Lys Ala Met Ala Pro Lys Tyr Glu Lys Leu Ala 

115 120 125 

Glu Glu Tyr Leu Asp Val He Phe Leu Lys Leu Asp Cys Asn Gin Glu 

130 135 140 

Asn Lys Thr Leu Ala Lys Glu Leu Gly He Arg Val Val Pro Thr Phe 
145 150 155 160 

Lys He Leu Lys Glu Asn Ser Val Val Gly Glu Val Thr Gly Ala Lys 

165 170 175 

Tyr Asp Lys Leu Leu Glu Ala He Gin Ala Ala Arg Ser Ser, 
180 185 190 



<210> 58 
<211> 106 
<212> PRT 
<213> Anabaena 

<400> 58 

Ser Ala Ala Ala Gin Val Thr Asp Ser Thr Phe Lys Gin Glu Val Leu 
1 5 10 15 
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Asp Ser Asp 


vai 
20 


i?ro 


Val Leu Val 


Pro Cys Arg 




Veil 


Aia irro vai 


35 






40 


Glu Gly Lys 


JLJLe 




vai vai Dys 


50 






55 


Val Ala Ser 


Gin 


Tyr 


Gly lie Arg 


65 






70 


Lys Gly Gly 


Gin 


Lys 


Val AspV^Iet 






85 


\ 


Thr Lqu Ser 


Gin 


Thr 


Leu Glu Lys 


t 


100 







Asp Phe Trp Ala Pro Trp Cys Gly 
25 30 
Val Asp Glu lie Ala Gin Gin Tyr 
45 

Val Asn Thr Asp Glu Asn Pro Gin 
60 

Ser He Pro Thr Leu Met He Phe 

75 80 
Val Val Gly Ala Val Pro Lys Thr 

90 95 
His Leu 
105 



<210> 59 
<211> 179 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 59 

Met Ala Ala Tyr Thr Cys Thr Ser Arg Pro Pro He Ser He Arg Ser 

1 * 5 10 15 

Glu Met Arg He Ala Ser Ser Pro Thr Gly Ser Phe Ser Thr Arg Gin 

20 25 30 

Met Phe Ser Val Leu Pro Glu Ser Ser Gly Leu Arg Thr Arg Val Ser 

35 40 45 

Leu Ser Ser Leu Ser Lys Asn Ser Arg Val Ser Arg Leu Arg Arg Gly 

50 m 55 60 

Val He Cys Glu Ala Gin Asp Thr Ala Thr Gly He Pro Val Val Asn 
65 70 75 80 

Asp Ser Thr Trp Asp Ser Leu Val Leu Lys Ala Asp Glu Pro Val Phe 

85 90 95 

Val Asp Phe Trp Ala Pro Trp Cys Gly Pro Cys Lys Met He Asp Pro 

100 * 105 " 110 

He Val Asn Glu Leu Ala Gin Lys Tyr Ala Gly Gin Phe Lys Phe Tyr 

115 120 125 

Lys Leu Asn Thr Asp Glu Ser Pro Ala Thr Pro Gly Gin Tyr Gly Val 

130 135 140 

Arg Ser He Pro Thr He Met He Phe Val Asn Gly Glu Lys Lys Asp 
145 150 155 160 

Thr He He Gly Ala Val Ser Lys Asp Thr Leu Ala Thr Ser He Asn 
165 170 175 

Lys Phe Leu 



<210> 60 
<211> 186 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 60 



Met 


Ala 


Ala 


Phe 


Thr 


Cys 


Thr 


Ser 


1 








5 






Glu 


Thr 


Arg 


He 
20 


Val 


Ser 


Ser 


Ser 


Arg 


Arg 


Met 


Phe 


Ala 


Val 


Leu 


Pro 




35 










40 


Leu 


Ser 
50 


Leu 


Ser 


Pro 


Ala 


Ser 
55 


Leu 


Ser 


Arg 


Leu 


Arg 


Arg 


Ala 


Val 


Val 


65 










70 






Asp 


He 


Gin 


Val 


Val 
85 


Asn 


Asp 


Ser 


Ala 


Thr 


Gly 


Pro 


Val 


Val 


Val Asp 








100 










Cys 


Lys 


Met 


He 


Asp 


Pro 


Leu 


Val 



Arg Pro Pro He Ser Leu Arg Ser 

10 15 
Pro Ser Ala Ser Ser Leu Ser Ser 
25 30 , 

Glu Ser Ser Gly Leu Arg He Arg 
45 

Thr Ser lie His Gin Pro Arg Val 
60 

Cys Glu Ala Gin Glu Thr Thr Thr 

75 80 
Thr Trp Asp Ser Leu Val Leu Lys 

90 95 
Phe Trp Ala Pro Trp Cys Gly Pro 
105 ' 110 
Asn Asp Leu Ala Gin His Tyr Thr 
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115 120 125 

Gly Lys He Lys Phe Tyr Lys Leu Asn Thr Asp Glu Ser Pro Asn Thr 

130 135 140 - 

Pro Gly Gin Tyr Gly Val Arg Ser He Pro Thr He Met He Phe Val 
145 150 155 160 

Gly Gly Glu Lys Lys Asp Thr He He Gly Ala Val Pro Lys Thr Thr 

165 ~ 170 " 175 

Leu Thr Ser Ser Leu Asp Lys Phe Leu Pro 
180 185 



<210> 61 
<211> 173 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 61 

Met Ala He Ser Ser Ser Ser Ser Ser He Cys Phe Asn Pro Thr Arg 

15 10 15 

Phe His Thr Ala Arg His He Ser Ser Pro Ser Arg Leu Phe Pro Val 

20 25 30 

Thr Ser Phe Ser Pro Arg Ser Leu Arg Phe Ser Asp Arg Arg Ser Leu 

35 40 45 

Leu Ser Ser Ser Ala Ser Arg Leu Arg Leu Ser Pro Leu Cys Val Arg 

50 55 60 

Asp Ser Arg Ala Ala Glu Val Thr Gin Arg Ser Trp Glu Asp Ser Val 
65 70 75 80 

Leu Lys Ser Glu Thr Pro Val Leu Val Glu Phe Tyr Thr Ser Trp Cys 

85 90 95 

Gly Pro Cys Arg Met Val His Arg He He Asp Glu He Ala Gly Asp 

100 105 110 

Tyr Ala Gly Lys Leu Asn Cys Tyr Leu Leu Asn Ala Asp Asn Asp Leu 

115 120 125 

Pro Val Ala Glu Glu Tyr Glu lie Lys Ala Val Pro Val Val Leu Leu 

130 135 140 

Phe Lys Asn Gly Glu Lys Arg Glu Ser He Met Gly Thr Met Pro Lys 
145 150 155 160 

Glu Phe Tyr He Ser Ala He Glu Arg Val Leu Asn Ser 
165 170 



<210> 62 
<211> 193 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 62 



Met 


Ala 


Ser 


Leu 


Leu 


Asp 


Ser Val 


1 








5 






Pro 


He 


Ala 


Ala 


Ser 


Val 


Ser Ser 


Arg 


Arg 


Arg 


20 
He 


Ser 


Pro 


Ala Arg 






35 








40 


Ser 


Ser 


Arg 


Ser 


Leu 


Val 


Thr Gin 




50 










55 


Arg 


Thr 


Arg 


He 


Ala 


Arg 


Gly Gly 


65 










70 




Thr 


Thr 


Ala 


Ala 


Ala 


Val 


Glu Val 










85 






Gin 


Thr 


Lys 


Val 


Leu 


Glu 


Ser Asp 








100 








Ala 


Pro 


Trp 


Cys 


Gly 


Pro 


Cys Arg 






115 








120 


Leu 


Ala 


Lys 


Asp 


Phe 


Ala 


Gly Lys 




130 










135 


Asp 


Glu 


Ser 


Pro 


Asn 


Thr 


Pro Asn 


145 










150 





Thr Val Thr Arg Val Phe Ser Leu 

10 15 
Ser Ser Ala Ala Pro Ser Val Ser 
25 30 
Phe Leu Glu Phe Arg Gly Leu Lys 
45 

Ser Ala Ser Leu Gly Ala Asn Arg 
60 

Arg He Ala Cys Glu Ala Gin Asp 

75 80 
Pro Asn Leu Ser Asp Ser Glu Trp 

90 95 
Val Pro Val Leu Val Glu Phe Trp 
105 110 
Met He His Pro He Val Asp Gin 
125 

Phe Lys Phe Tyr Lys He Asn Thr 
140 

Arg Tyr Gly He Arg Ser Val Pro 
155 160 
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Thr Val lie He Phe Lys Gly Gly Glu Lys Lys Asp Ser He He Gly 

165 170 175 

Ala Val Pro Arg Glu Thr Leu Glu Lys Thr He Glu Arg Phe Leu Val 
180 185 190 

Glu 



<210> 63. 
<211> 177 
<212> PRT 

<213> Brassica napus 
<400> 63 

Met Ala Ala Phe Thr Cys Thr Ser Ser Pro Pro He Ser Leu Arg Ser 

15 10 15 

Glu Met Met He Ala Ser Ser Lys Thr Val Ser Leu Ser Thr Arg Gin 

20 25 30 

Met Phe Ser Val Gly Gly Leu Arg Thr Arg Val Ser Leu Ser Ser Val 

35 40 45 

Ser Lys Asn Ser Arg Ala Ser Arg Leu Arg Arg Gly Gly He He Cys 

50 55 60 

Glu Ala Gin Asp Thr Ala Thr Gly He Pro Met Val Asn Asp Ser Thr 
65 70 75 80 

Trp Glu Ser Leu Val Leu Lys Ala Asp Glu Pro Val Val Val Asp Phe 

85 90 95 

Trp Ala Pro Trp Cys Gly Pro Cys Lys Met He Asp Pro He Val Asn 

100 105 110 

Glu Leu Ala Gin Gin Tyr Thr Gly Lys He Lys Phe Phe Lys Leu Asn 

115 120 ~ 125 

Thr Asp Asp Ser Pro Ala Thr Pro Gly Lys Tyr Gly Val Arg Ser He 

130 135 140 

Pro Thr He Met He Phe Val Lys Gly Glu Lys Lys Asp Thr He He 
145 150 155 160 

Gly Ala Val Pro Lys Thr Thr Leu Ala Thr Ser He Asp Lys Phe Leu 
165 170 175 

Gin 



<210> 64 
<211> 140 
<212> PRT 

<213> Chlamydomonas reinhardtii 



<400> 64 



Met 


Ala 


Leu Val Ala 


Arg Arg 


Ala 


l 




5 




Ala 


Arg 


Pro Ala Phe 


Ala Arg 


Ala 






20 




Arg 


Ala 


Glu Ala Gly 


Ala Val 


Asn 






35 




40 


Leu 


Glu 


Ser Ser Val 


Pro Val 


Leu 




50 




55 




Gly 


Pro 


Cys Arg He 


He Ala 


Pro 


65 






70 




Tyr 


Lys 


Asp Lys Leu 


Lys Cys 


Val 






85 






Asn 


Val 


Ala Ser Glu 


Tyr Gly 


lie 






100 






Phe 


Lys 


Gly Gly Lys 


Lys Cys 


Glu 






115 




120 


Ala 


Thr 


He Val Gin 


Thr Val 


Glu 




130 




135 





Ala Val Pro Ser Ala Arg Ser Ser 

10 15 
Ala Pro Arg Arg Ser Val Val Val 
25 30 
Asp Asp Thr Phe Lys' Asn Val Val 
45 

Val Asp Phe Trp Ala Pro Trp Cys' 
60 

Val Val Asp Glu He Ala Gly Glu 

75 80 
Lys Leu Asn Thr Asp Glu Ser Pro 

90 95 
Arg Ser He Pro Thr He Met Val 
105 110 
Thr He He Gly Ala Val Pro Lys 
125 

Lys Tyr Leu Asn 
140 



<210> 65 
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<211> 167 
<212> PRT 
<213> Zea mays 



<400> 65 



Met Ala 


Met 


Glu Thr 


Cvs 


Phe 


Ara 


1 




5 








Gly Ser 


Lys 


Asn Arcr 


Leu 


Leu 


Val 






20 








A*rcr AT a 


Leu 


Ala Pro 


Leu 


Ser 


Val 


35 








40 


Arg His 


Val 


rvc Gin 


Ser 


Lvs 


Asn 


50 








55 




Ann Glu 


Lys 


Asn Trp 


Asp 


Glv 


Leu 


65 






70 






Leu Val 


Glu 


Phe Trp 


Ala 


Pro 


Trp 






85 








Pro Val 


He 


Asp Glu 


Leu 


Ala 


Lys 






100 








Cys Lys 


Val 


Asn Thr 

4iu x a j.iix 


Asp 


Asp 


Ser 




115 








120 


lie Arg 


Ser 


He Pro 


Thr 


Val 


Leu 


130 








135 




Glu Ser 


Val 


Tl^ Glv 


Ala 


Val 


Pro 


145 




150 






Asp Lys 


Tyr 


He Gly 


Ser 


Ser 








165 








<210> 66 










<211> 172 










<212> PRT 










<213> Oryza 


OCIU J. V CI 








<400> 66 










Met Ala 


Leu 


uIU ±111. 


y b 


XT I1C 




1 




*J 








Gin Pro 


Pro 


OCX OCX 


Glv 


Glv 


OC J_ 






20 








Ala Gly 


Ser 


OCX. VJXU 


Ser 


Lys 


Pro 


35 








40 


Pro Leu 


Arg 


Pro Ala 


Ser 




Phe 


50 








55 




Asp Glu 


Val 


Val Val 


Ala 


Asp 


Glu 


65 






70 






Gly Ser 


Glu 


Ala Pro 


Val 


Leu 


Val 




85 








Pro Cys 


Arg 


Met He 


Ala 


Pro 


Val 


100 








Val Gly 


Lys 


He Lys 


Cys 


Cys 


Lys 




115 








120 


lie Ala 


Thr 


Asn Tyr 


Gly 


He 


Arg 


130 








135 




Lys Asn 


Gly 


Gin TiVS 


xiy a 


Gin 


IJCJL 


145 






150 






Thr Leu 


Ala 


Thr He 


He 


Asp 


Lys 






165 






<210> 67 










<211> 172 










<212> PRT 










<213> Pisum 


sativum 








<400> 67 










Met Ala 


Leu 


Glu Ser 


Leu 


Phe 


Lys 



1 5 





xrp- Aia 


Leu 


xilS 


ax a 


Pro 


Til a 

ax a 




i n 








i ^ 




uxy 


Asn jueu 


vax 


lieu 


Pro 


Ser 


Lys 










•3 U 






ft *T 

vjxy 


Arg vax 


Aia 


inr 


Arg 


Arg 


Pro 
















Aia 


vax Asp 


VaXU 


vax 


vax 


vax 


Til a 

ax a 






oU 










Val 


Met Ala 


Cys 


Glu 


Thr 


Pro 


Val 




75 








80 


Cys 


Gly Pro 


Cys 


Arg 


Met 


He 


Ala 




90 








95 




Asp 


Tyr Ala 


Gly 


Lys 


He 


Thr 


Cys 


105 








XlU 






Pro 


Asn Val 


Ala 


Ser 


Thr 


Tyr 


Gly 








125 








He 


Phe Lys 


Gly 


Gly 


Glu 


Lys 


Lys 






140 










Lys 


Ser Thr 


Leu 


Thr 


Thr 


Leu 


He 




ICC 

155 










XoO 


Ala 


Trp Ala 


Thr 


Leu 


His 


TV 1 — 

Ala 


Pro 




10 








X5 




Arg 


Asp Arg 


Leu 


Leu 


Leu 


Ser 


Gly 


25 








JO 






Arg 


Leu Ser 


Val 


Ala 


Ser 


Pro 


Ser 






A tr 
45 








Ala 


Cys Gin 


Cys 


Ser 


Asn 


Vax 


vax 






60 










Lys 


Asn Trp 


Asp 


Ser 


Met 


vax 


Leu 




75 










Q A 

oU 


Glu 


Phe Trp 


Ala 


Pro 


Trp 


Cys 


Gly 




90 








95 




He 


Asp Glu 


Leu 


Ala 


Lys 


Glu 


Tyr 


105 








110 






Val 


Asn Thr 


Asp 


Asp 


Ser 


Pro 


Asn 








125 








Ser 


He Pro 


Thr 


Val 


Leu 


Met 


Phe 






140 










Val 


He Gly 


Ala 


Val 


Pro 


Lys 


Thr 




155 










160 


Tyr 


Val Ser 


Ser 










170 












Ser 


He His 


Thr 


Lys 


Thr 


Ser 

1 c 


Leu 



10 15 
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Ser 


Ser 


Ser lie 


Val 


Phe lie Phe 






20 






Ser 


Lys 


Ser Arg 


lie 


Gin Glu Ser 






35 




40 


Ser 


Leu 


Val Leu 


Leu 


lie Glu Asn 




50 






55 


Ala 


Val 


Asn Glu 


Val 


Gin Val Val 


65 








70 


Val 


lie 


Gly Ser 


Glu 


Thr Pro Val 








85 




Cys 


Gly 


Pro Cys 


Arg 


Met lie Ala 






100 






Glu 


Tyr 


Ala Gly 


Lys 


lie Lys Cys 






115 




120 


Pro 


Asn 


Thr Ala 


Thr 


Lys Tyr Gly 




130 






135 


Phe 


Phe 


Lys Asn 


Gly 


Glu Arg Lys 


145 








150 


Lys 


Ala 


Thr Leu 


Ser 


Glu Lys Val 



165 



Lys Gly Lys Ala Cys Leu Leu Thr 
25 ' 30 

Phe Ala Glu Leu Asn Ser Phe Thr 
45 

His Val Leu Leu His Ala Arg Glu 
60 

Asn Asp Ser Ser Trp Asp Glu Leu 

75 " 80 

Leu Val Asp Phe Trp Ala Pro Trp 

90 95 
Pro He He Asp Glu Leu Ala Lys 
105 110 
Tyr Lys Leu Asn Thr Asp Glu Ser 
125 

He Arg Ser He Pro Thr Val Leu 
140 

Asp Ser Val He Gly Ala Val Pro 
155 160 
Glu Lys Tyr He 
170 



<210> 68 
<211> 181 
<212> PRT 

<213> Spinacia oleracea 
<400> 68 

Met Ala He Glu Asn Cys Leu Gin Leu Ser Thr Ser Ala Ser Val Gly 

1 5 10 15 

Thr Val Ala Val Lys Ser His Val His His Leu Gin Pro Ser Ser Lys 

20 25 30 

Val Asn Val Pro Thr Phe Arg Gly Leu Lys Arg Ser Phe Pro Ala Leu 

35 40 * ~ 45 

Ser Ser Ser Val Ser Ser Ser Ser Pro Arg Gin Phe Arg Tyr Ser Ser 

50 55 60 

Val Val Cys Lys Ala Ser Glu Ala Val Lys Glu Val Gin Asp Val Asn 
65 70 75 80 

Asp Ser Ser Trp Lys Glu Phe Val Leu Glu Ser Glu Val Pro Val Met 

85 90 95 

Val Asp Phe Trp Ala Pro Trp Cys Gly Pro Cys Lys Leu He Ala Pro 

100 105 110 

Val lie Asp Glu Leu Ala Lys Glu Tyr Ser Gly Lys He Ala Val Tyr 

115 120 125 

Lys Leu Asn Thr Asp Glu Ala Pro Gly lie Ala Thr Gin Tyr Asn He 

130 135 140 

Arg Ser He Pro Thr Val Leu Phe Phe Lys Asn Gly Glu Arg Lys Glu 
145 150 155 ~ " 160 

Ser He He Gly Ala Val Pro Lys Ser Thr Leu Thr Asp Ser He Glu 

165 * 170 175 

Lys Tyr Leu Ser Pro 
180 



<210> 69 
<211> 175 
<212> PRT 

<213> Triticum aestivum 
<400> 69 

Met Ala Leu Glu Thr Cys Leu Arg Gly Trp Ala Leu Tyr Ala Pro Gin 

15 10 15 

Ala Gly He Arg Glu Arg Leu Ser Ser Gly Ser Tyr Ala Pro Ser Arg 

20 25 30 

Pro Arg Thr Ala Ala Pro Ala Val Val Ser Pro Ser Pro Tyr Lys Ser 

35 40 45 

Ala Leu Val Ala Ala Arg Arg Pro Ser Arg Phe Val Cys Lys Cys Lys 
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50 








55 




Asn 


Val 


Val 


Asp Glu 


Val 


He 


Val 


65 








70 






Met 


Val 


He 


Ala Cys 
85 


Glu 


Ser 


Pro 


Trp 


Cys 


Gly 


Pro Cys 
100 


Arg 


Met 


He 


Lys 


Asp 


Tyr 
115 


Val Gly 


Lys 


He 


Lys 
120 


Cys 


Pro 


Asn 


He Ala 


Ser 


Thr 


Tyr 


130 








135 




Leu 


Met 


Phe 


Lys Asp 


Gly 


Glu 


Lys 


145 








150 






Pro 


Lys 


Thr 


Thr Leu 


Cys 


Thr 


He 



165 



60 

Ala Asp Glu Lys Asn Trp Asp Asn , 

75 ~ 80 

Val Leu Val Glu Phe Trp Ala Pro 

. 90 95 
Ala Pro Val He Asp Glu Leu Ala 
105 110 
Cys Cys Lys Val Asn Thr Asp Asp 
125 

Gly He Arg Ser He Pro Thr Val 
140 

Lys Glu Ser Val He Gly Ala Val 
155 160 
He Asp Lys Tyr He Gly Ser 
170 ' 175 



<210> 70 
<211> 106 
<212> PRT 

<213> Anacystis nidulans 



<400> 70 



Ser Val 


Ala 


Ala Ala Val Thr 


Asp 


1 

Glu Ser 


Ser 


5 

He Pro Val Leu 


Val 






20 




Pro Cys 


Arg 


Met Val Ala Pro 


Val 




35 




40 


Ser Asp 


Gin 


Val Lys Val Val 


Lys 


50 




55 




Val Ala 


Ser 


Gin Tyr Gly He 


Arg 


65 




70 




Lys Asp 


Gly 


Gin Arg Val Asp 


Thr 






85 




Thr Leu 


Ala 


Asn Thr Leu Asp 


Lys 



100 



Ala Thr Phe Lys Gin Glu Val Leu 

10 15 
Asp Phe Trp Ala Pro Trp Cys Gly 
25 30 
Val Asp Glu He Ala Gin Gin Tyr 
45 

Val Asn Thr Asp Glu Asn Pro Ser 
60 

Ser He Pro Thr Leu Met He Phe 

75 80 
Val Val Gly Ala Val Pro Lys Thr 

90 95 
His Leu 
105 



<210> 71 
<211> 107 
<212> PRT 

<213> Cyanidium caldarium 
<400> 71 

Met Pro Ser Pro He Gin Val Thr Asp Phe Ser Phe Glu Lys Glu Val 

15 10 15 

Val Asn Ser Glu Lys Leu Val Leu Val Asp Phe Trp Ala Pro Trp Cys 

20 " 25 30 

Gly Pro Cys Arg Met He Ser Pro Val He Asp Glu Leu Ala Gin Glu 

35 40 45 

Tyr Val Glu Gin Val Lys He Val Lys He Asn Thr Asp Glu Asn Pro 

50 55 60 

Ser He Ser Ala Glu Tyr Gly He Arg Ser He Pro Thr Leu Met Leu 
65 70 75 80 

Phe Lys Asp Gly Lys Arg Val Asp Thr Val He Gly Ala Val Pro Lys 

~ 85 90 ' 95 

Ser Thr Leu Thr Asn Ala Leu Lys Lys Tyr Leu 
100 105 



<210> 72 
<211> 102 
<212> PRT 

<213> Cyanidioschyzon merolae 
<400> 72 
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Met 


Leu 


His 


lie 


Asp 


Glu 


Leu 


Thr Phe Glu 


Asn Glu Val Leu Gin Ser 


1 








5 






10 


15 


Glu 


Lys 


Leu 


Val 


Leu 


Val 


Asp 


Phe Trp Ala 


Pro Trp Cys Gly Pro Cys 








20 








25 


30 


Arg 


Met 


lie 


Gly 


Pro 


He 


Leu 


Glu Glu He 


Ala Lys Glu Phe Asn Leu 




35 








40 


45 


Lys 


Val 


Val 


Gin 


Val 


Asn 


Thr 


Asp Glu Asn 


Pro Asn Leu Ala Thr Phe 


50 










55 


60 


Tyr 


Gly 


lie 


Arg 


Ser 


He 


Pro 


Thr Leu Met 


Leu Phe Lys Lys Gly Gin 


65 










70 






75 ^ 80 


Arg 


Val 


Asp 


Thr 


Val 


He 


Gly 


Ala Val Pro 


Lys Ser He Leu He His 










85 






90 


95 


Thr 


lie 


Asn 


Lys 


Tyr 


Leu 














100 













<210> 73 
<211> 109 
<212> PRT 

<213> Griff ithsia pacifica 
<400> 73 



Met 


Ser 


He 


Ser 


Gin 


Val 


He 


Asp Thr Ser 


Phe His Glu Glu Val He 


1 








5 






10 


15 


Asn 


Ser 


Arg 


Gin 


Pro 


Val 


Leu 


Val Asp Phe 


Trp Ala Pro Trp Cys Gly 






20 








25 


30 


Pro 


Cys 


Arg 


Met 


He 


Ala 


Ser 


Thr He Asp 


Glu He Ala His Asp Tyr 






35 










40 


45 


Lys 


Asp 


Lys 


Leu 


Lys 


Val 


Val 


Lys Val Asn 


Thr Asp Gin Asn Pro Thr 




50 










55 




60 


He 


Ala 


Thr 


Glu 


Tyr 


Gly 


He 


Arg Ser He 


Pro Thr Val Met He Phe 


65 








70 




75 80 


He 


Asn 


Gly 


Lys 


Lys 


Val 


Asp 


Thr Val Val 


Gly Ala Val Pro Lys Leu 










85 






90 


95 


Thr 


Leu 


Leu 


Asn 


Thr 


Leu 


Gin 


Lys His Leu 


Lys Ser Thr 








100 








105 





<210> 74 
<211> 107 
<212> PRT 

<213> Porphyra yezoensis 



<400> 74 



Met 


Ser 


Val 


Ser 


Gin 


Val 


Thr 


Asp Ala Ser 


Phe Lys Gin Glu Val lie 


1 








5 






10 


15 


Asn 


Asn 


Asn 


Leu 


Pro 


Val 


Leu 


Val Asp Phe 


Trp Ala Pro Trp Cys Gly 








20 








25 


30 


Pro 


Cys 


Arg 


Met 


Val 


Ser 


Pro 


Val Val Asp 


Glu lie Ala Glu Glu Tyr 




35 










40 


45 


Glu 


Ser 


Ser 


He 


Lys 


Val 


Val 


Lys He Asn 


Thr Asp Asp Asn Pro Thr 




50 










55 




60 


He 


Ala 


Ala 


Glu 


Tyr 


Gly 


lie 


Arg Ser lie 


Pro Thr Leu Met He Phe 


65 








70 




75 80 


Lys 


Ala 


Gly 


Glu 


Arg 


Val 


Asp 


Thr Val He 


Gly Ala Val Pro Lys Ser 










85 






90 


95 


Thr 


Leu 


Ala 


Ser 


Thr 


Leu 


Asn 


Lys Tyr He 


Ser 








100 








105 





<210> 75 
<211> 107 
<212> PRT 

<213> Porphyra purpurea 
<400> 75 

Met Ser Val Ser Gin Val Thr Asp Ala Ser Phe Lys Gin Glu Val He 
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1 
J- 




c 
D 




J-ioIl rlSIl -rtojJ 


Leu 


•tiu 


x/al T.e»n Val 
VaX ijcu VO.J. 




z u 






iriO vjyS ATy 


Mpf- 
rlcL 


v ax 


oci xriO Vol 


3b 








Glu Ser Ser 


lie 


iiys 


Val Val Lys 


DO 








lie Ala Ala 


Glu 


Tyr 


Gly lie Arg 


65 






70 


Lys Ser Gly 


Glu 


Arg 


Val Asp Thr 






85 




Thr Leu Glu 


Ser 


Thr 


Leu Asn Lys 




100 





10 15 
Asp Phe Trp Ala Pro Trp Cys Gly 
25 *" 30 

Val Asp Ala He Ala Glu Glu Tyr 
45 

lie Asn Thr Asp Asp Asn Pro Thr 
60 

Ser He Pro Thr Leu Met He Phe 

75 80 
Val He Gly Ala Val Pro Lys Ser 

90 95 
Tyr He Ser 
105 



<210> 76 
<211> 114 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 76 



Met 


Ala 


Ser 


Glu Glu Gly 


Gin Val 


1 






5 




Trp 


Asn 


Glu 


Gin Leu Gin 


Lys Ala 






20 




Val 


Asp 


Phe 


Thr Ala Ser 


Trp Cys 






35 




40 


Phe 


Phe 


Ala 


Asp Leu Ala 


Lys Lys 




50 






55 


Val 


Asp 


Thr 


Asp Glu Leu 


Lys Ser 


65 






70 




Ala 


Met 


Pro 


Thr Phe Met 


Phe Leu 








85 




Val 


Val 


Gly 


Ala Lys Lys 


Asp Glu 








100 




Leu 


Ala 









He Ala Cys His Thr Val Glu Thr 

10 15 
Asn Glu Ser Lys Thr Leu Val Val 
25 30 
Gly Pro Cys Arg Phe He Ala Pro 
45 

Leu Pro Asn Val Leu Phe Leu Lys 
60 

Val Ala Ser Asp Trp Ala He Gin 

75 80 
Lys Glu Gly Lys He Leu Asp Lys 

90 95 
Leu Gin Ser Thr He Ala Lys His 
105 110 



<210> 77 
<211> 110 
<212> PRT 
<213> Anabaena 

<400> 77 

Ser Lys Gly Val He Thr He Thr Asp Ala Glu Phe Glu Ser Glu Val 

15 10 15 

Leu Lys Ala Glu Gin Pro Val Leu Val Tyr Phe Trp Ala Ser Trp Cys 

20 25 * 30 

Gly Pro Cys Gin Leu Met Ser Pro Leu He Asn Leu Ala Ala Asn Thr 

35 40 45 

Tyr Ser Asp Arg Leu Lys Val Val Lys Leu Glu He Asp Pro Asn Pro 

50 55 60 

Thr Thr Val Lys Lys Tyr Lys Val Glu Gly Val Pro Ala Leu Arg Leu 
65 70 ~ 75 80 

Val Lys Gly Glu Gin He Leu Asp Ser Thr Glu Gly Val He Ser Lys 

85 90 '95 

Asp Lys Leu Leu Ser Phe Leu Asp Thr His Leu Asn Asn Asn 
100 105 110 



<210> 78 
<211> 123 
<212> PRT 

<213> Brassica napus 
<400> 78 
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Ala 


Thr 


Ala 


GIU 


vai 


116 


iro Aia \aXy Glu val 


Tic* 

lie 


Ala 


Cys 


i 
l 








5 








1U 




15 




TT J e 


Tiir 


VcLX 


LjJLU 


Asp 


Trp 


Asn 


Asn 


jjys jjcu iiys Aia Aia 

AO 


Lys 


Glu 


Ser 


Asn 


Lys 


Leu 

o 3 


Tl ck 

lie 


vax 


lie 


Asp 


It lie 


nix Aia vai irp uys 

43 


Pro 


Pro 


Cys 


Arg 


Phe 


He 


Ala 


Pro 


He 


Phe 


Val 


Glu Leu Ala Lys Lys 


His 


Leu Asp 




3U 














c n 








Val 


Val 


Phe 


Phe 


Lys 


Val 


Asp 


Val 


Asp Glu Leu Ala Thr 


Val 


Ala 


Gin 










/U 




/ 3 






80 


Glu 


Phe 


Asp 


Val 


Gin 


Ala 


Met 


Pro 


Thr Phe Val Tyr Met 


Lys Gly Glu 










85 








90 




95 




Glu 


Lys 


Leu 


Asp 
100 


Lys 


Val 


Val 


Gly 


Ala Ala Lys Glu Glu 
105 


He 
110 


Glu 


Ala 


Lys 


Leu 


Leu 
115 


Lys 


His 


Ser 


Gin 


Val 
120 


Ala Ala Ala 









<210> 79 
<211> 126 
<212> PRT 

<213> Nicotiana tabacum 
<400> 79 



Met 


Ala 


Ala 


Asn 


Asp 


Ala 


Thr 


Ser 


Ser Glu Glu Gly Gin 


Val Phe Gly 


1 








5 








10 


15 




Cys 


His 


Lys 


Val 


Glu 


Glu 


Trp 


Asn 


Glu Tyr Phe Lys Lys 


Gly Val 


Glu 








20 










25 


30 




Thr 


Lys 


Lys 


Leu 


Val 


Val 


Val 


Asp 


Phe Thr Ala Ser Trp 


Cys Gly 


Pro 






35 










40 


45 






Cys 


Arg 


Phe 


He 


Ala 


Pro 


He 


Leu 


Ala Asp He Ala Lys 


Lys Met 


Pro 




50 










55 




60 






His 


Val 


He 


Phe 


Leu 


Lys 


Val 


Asp 


Val Asp Glu Leu Lys 


Thr Val 


Ser 


65 










70 




75 




80 


Ala 


Glu 


Trp 


Ser 


Val 


Glu 


Ala 


Met 


Pro Thr Phe Val Phe 


He Lys 


Asp 








85 








90 


95 




Gly 


Lys 


Glu 


Val 


Asp 


Arg 


Val 


Val 


Gly Ala Lys Lys Glu 


Glu Leu 


Gin 








100 










105 


110 




Gin 


Thr 


He 


Val 


Lys 


His 


Ala 


Ala 


Pro Ala Thr Val Thr 


Ala 





115 120 125 



<210> 80 
<211> 133 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 80 



Met 


Gly 


Gly 


Ala 


Leu 


Ser 


Thr 


Val 


Phe Gly Ser Gly Glu 


Asp Ala Thr 


1 








5 








10 


15 


Ala 


Ala 


Gly 


Thr 


Glu 


Ser 


Glu 


Pro 


Ser Arg Val Leu Lys 


Phe Ser Ser 






20 










25 


30 


Ser 


Ala 


Arg 
35 


Trp 


Gin 


Leu 


His 


Phe 
40 


Asn Glu He Lys Glu 
45 


Ser Asn Lys 


Leu 


Leu 


Val 


Val 


Asp 


Phe 


Ser 


Ala 


Ser Trp Cys Gly Pro 


Cys Arg Met 




50 










55 




60 


Asp Val Asp 


He 


Glu 


Pro 


Ala 


He 


His 


Ala 


Met 


Ala Asp Lys Phe Asn 


65 










70 






75 


80 


Phe 


Val 


Lys 


Leu 


Asp 
85 


Val 


Asp 


Glu 


Leu Pro Asp Val Ala 
90 


Lys Glu Phe 
95 


Asn 


Val 


Thr 


Ala 
100 


Met 


Pro 


Thr 


Phe 


Val Leu Val Lys Arg 
105 


Gly Lys Glu 
110 


He 


Glu 


Arg 
115 


He 


He 


Gly 


Ala 


Lys 
120 


Lys Asp Glu Leu Glu 
125 


Lys Lys Val 


Ser 


Lys 
130 


Leu 


Arg 


Ala 
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<210> 81 
<211> 119 
<212> PRT 

<213> Brassica napus 
<400> 81 



Z'ic; l. 
n 


Ala 


nla LrJL 11 


fllli Hlw Pin Tfal 
W7JLU viJLy LrJ.Il VcLX 


X 

Trp 


AJ.E 


T /"IT—, 

vai ii_Ln 


5 

Leu Asp Thr Ala 








lie 


Asp 


Phe Thr 


Ala Ser Trp Cys 






35 


40 


Val 


Phe 


Ala Asp 


Leu Ala Lys Lys 




50 




55 


Lys 


Val 


Asp Val 


Asp Glu Leu Gin 


65 






70 


Glu 


Ala 


Met Pro 


Thr Phe Val Leu 








85 


Lys 


Val 


Val Gly 


Ala Arg Lys Glu 






100 




His 


Thr 


Gly Val 


Ala Thr Ala 






115 





lie Gly Cys His Glu lie Asp Val 

10 15 
Lys Gin Ser Asn Lys Leu lie Val 
25 30 
Pro Pro Cys Arg Met lie Ala Pro 
45 

Phe Met Ser Ser Ala lie Phe Phe 
60 

Asn Val Ala Gin Glu Phe Gly Val 

75 80 
lie Lys Asp Gly Asn Val Val Asp 

90 95 
Asp Leu His Ala Thr He Ala Lys 
105 110 



<210> 82 
<211> 118 
<212> PRT 

<213> Nicotiana tabacum 
<400> 82 



Met 


Ala 


Glu 


Glu 


Gly 


Gin Val He 


1 








5 




Asn 


Glu 


His 


Leu 


Gin 


Lys Gly He 








20 




Asp 


Phe 


Thr 


Ala 


Ser 


Trp Cys Gly 






35 






40 


Tyr 


Ala 


Glu 


Leu 


Ala 


Lys Lys Met 




50 








55 


Asp 


Val 


Asp 


Glu 


Leu 


Lys Ser Val 


65 










70 • 


Met 


Pro 


Thr 


Phe 


Met 


Phe Leu Lys 










85 


Val 


Gly 


Ala 


Lys 


Lys 


Asp Glu Leu 








100 




Ser 


Ser 


Thr 


Ser 


Thr 


Ala 






115 









Gly Val His Thr Val Asp Ala Trp 

10 15 
Asp Asp Lys Lys Leu He Val Val 
25 30 
Pro Cys Lys Phe He Ala Ser Phe 
45 

Pro Thr Val Thr Phe Leu Lys Val 
60 

Ala Thr Asp Trp Ala Val Glu Ala 

75 80 
Glu Gly Lys He Val Asp Lys Val 

90 95 
Gin Gin Thr He Ala Lys His He 
105 110 



<210> 83 
<211> 118 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 83 



Met Ala 


Ala 


Glu 


Gly 


Glu 


Val 


He 


1 






5 








Thr Glu 


Lys 


Leu 


Lys 


Ala 


Ala Asn 






20 








Asp Phe 


Thr 


Ala 


Thr 


Trp 


Cys 


Pro 




35 










40 


Phe Ala 


Asp 


Leu 


Ala 


Lys 


Lys 


His 


50 










55 




Asp Val 


Asp 


Glu 


Leu 


Asn 


Thr 


Val 


65 








70 






Met Pro 


Thr 


Phe 


He 


Phe 


Met 


Lys 








85 






Val Gly 


Ala 


Ala 


Lys 


Glu 


Glu 


He 



Ala Cys His Thr Val Glu Asp Trp 

10 , 15 

Glu Ser Lys Lys Leu He Val He 
25 30 
Pro Cys Arg Phe lie Ala Pro Val 
45 

Leu Asp Val Val Phe Phe Lys Val 
60 

Ala Glu Glu Phe Lys Val Gin Ala 

75 80 
Glu Gly Glu He Lys Glu Thr Val 

90 95 
He Ala Asn Leu Glu Lys His Lys 
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100 

Thr Val Val Ala Ala Ala 
115 



105 



110 



<210> 84 
<211> 125 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 84 



Met 


Ala 


Ala 


Glu 


Glu 


Gly 


Gin Val 


1 








5 




Trp 


Thr 


Val 


Gin 


Leu 


Asp 


Lys Ala 








20 








He 


Asp 


Phe 


Thr 


Ala 


Ser 


Trp Cys 






35 








40 


He 


Phe 


Asn 


Asp 


Leu 


Ala 


Lys Lys 




50 










55 


Lys 


Val 


Asp 


Val 


Asp 


Glu 


Leu Gin 


65 










70 




Glu 


Ala 


Met 


Pro 


Thr 


Phe 


Val Phe 










85 






Lys 


Leu 


Val 


Gly 


Ala 


Asn 


Lys Glu 








100 








His 


Thr 


Gly 


Val 


Thr 


Thr 


Val Val 






115 








120 



He Gly Cys His Thr Asn Asp Val 

10 " 15 
Lys Glu Ser Asn Lys Leu He Val 
25 30 
Pro Pro Cys Arg Met He Ala Pro 
45 

Phe Met Ser Ser Ala He Phe Phe 
60 

Ser Val Ala Lys Glu Phe Gly Val 

75 80 
He Lys Ala Gly Glu Val Val Asp 

90 95 
Asp Leu Gin Ala Lys He Val. Lys 
105 110* 
Asn Gin Phe Glu Ala 
125 



<210> 85 
<211> 118 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 85 

Met Ala Gly Glu Gly Glu Val He Ala Cys His Thr Leu Glu Val Trp 

1 5 10 15 

Asn Glu Lys Val Lys Asp Ala Asn Glu Ser Lys Lys Leu He Val He 

20 25 30 

Asp Phe Thr Ala Ser Trp Cys Pro Pro Cys Arg Phe He Ala Pro Val 

35 40 45 

Phe Ala Glu Met Ala Lys Lys Phe Thr Asn Val Val Phe Phe Lys He 

50 55 60 

Asp Val Asp Glu Leu Gin Ala Val Ala Gin Glu Phe Lys Val Glu Ala 
65 70 75 80 

Met Pro Thr Phe Val Phe Met Lys Glu Gly Asn He He Asp Arg Val 

85 90 95 

Val Gly Ala Ala Lys Asp Glu He Asn Glu Lys Leu Met Lys His Gly 

100 105 110 

Gly Leu Val Ala Ser Ala 
115 



<210> 86 
<211> 123 
<212> PRT 

<213> Brassica rapa 
<400> 86 

Met Ala Ala Thr Ala Glu Leu He Pro Ala Gly Glu Val He Ala Cys 

15 10 15 

His Thr Val Glu Asp Trp Asn Asn Lys Leu Lys Ala Ala Lys Glu Ser 

20 25 30 

Asn Lys Leu He Val He Asp Phe Thr Ala Val Trp Cys Pro Pro Cys 

35 40 45 

Arg Phe He Ala Pro He Phe Val Glu Leu Ala Lys Lys His Leu Asp 
50 55 60 
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Val Val Phe Phe Lys Val Asp Val Asp Glu Leu Ala Thr Val Ala Lys 
65 70 75 80 

Glu Phe Asp Val Gin Ala Met Pro Thr Phe Val Tyr Met Lys Gly Glu 

85 90 95 

Glu Lys Leu Asp Lys Val Val Gly Ala Ala Lys Glu Glu He Glu Ala 

100 105 110 

Lys Leu Leu Lys His Ser Gin Val Ala Ala Ala 
115 120 



<210> 87 
<211> 112 
<212> PRT 

<213> Chlamydomonas reinhardtii 



<400> 87 



Gly Gly Ser 


Val 


He 


Val 


He Asp 


1 




5 






Leu Ala Lys 


Gly 


Lys 


Glu 


Glu His 




20 








Ala Thr Trp 


Cys 


Gly 


Pro 


Cys Lys 


35 








40 


Leu Ser Asn 


Asp 


Tyr 


Ala 


Gly Lys 


50 








55 


Asp Ala Val 


Ala 


Ala 


Val 


Ala Glu 


65 






70 




Thr Phe His 


Val 


Tyr 


Lys 


Asp Gly 






85 






Ala Ser Gin 


Asp 


Lys 


Leu 


Lys Ala 



100 



Ser Lys Ala Ala Trp Asp Ala Gin 

10 15 
Lys Pro He Val Val Asp Phe Thr 
25 30 
Met He Ala Pro Leu Phe Glu Thr 
45 

Val He Phe Leu Lys Val Asp Val 
60 

Ala Ala Gly He Thr Ala Met Pro 

75 80 
Val Lys Ala Asp Asp Leu Val Gly 

90 95 
Leu Val Ala Lys His Ala Ala Ala 
105 110 



<210> 88 
<211> 116 
<212> PRT 

<213> Fagopyrum esculentum 
<400> 88 



Met 


Ala 


Glu 


Glu 


Ala 


Gin Val He 


1 








5 




Asn 


Glu 


Lys 


Phe 
20 


Gin 


Lys Ala Lys 


Asp 


Phe 


Thr 
35 


Ala 


Ser 


Trp Cys Gly 
40 


Val 


Ser 
50 


Glu 


Leu 


Ala 


Lys Lys Phe 
. 55 


Asp 


Val 


Asp 


Asp 


Leu 


Lys Asp Val 


65 










70 


Met 


Pro 


Ser 


Phe 


Val 


He Leu Lys 










85 


Val 


Gly 


Ala 


Arg 
100 


Lys 


Asp Glu Leu 


Pro 


He 


Thr 
115 


Ala 







Ala Cys His Thr Val Gin Glu Trp 

10 15 
Asp Ser Gly Lys Leu He Val He 
25 30 
Pro Cys Arg Val He Thr Pro Tyr 
45 

Pro His Val Ala Phe Phe Lys Val 
60 

Ala Glu Glu Tyr Lys Val Glu Ala 

75 80 
Glu Gly Gin Glu Val Glu Arg He 

90 95 
Leu His Lys He Ala Val His Ala 
105 110 



<210> 89 

<211> 122 

<212> PRT 

<213> Oryza sativa 

<400> 89 

Met Ala Ala Glu Glu Gly Val Val 

1 5 
Phe Asp Ala Gin Met Thr Lys Ala 
20 

He Asp Phe Thr Ala Ser Trp Cys 



He Ala Cys His Asn Lys Asp Glu 

10 15 
Lys Glu Ala Gly Lys Val Val He 
25 30 
Gly Pro Cys Arg Phe He Ala Pro 
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35 








40 


Val Phe Ala 


Glu 


Tyr 


Ala Lys 


Lys Phe 


50 






55 




Val Asp Val 


Asp 


Glu 


Leu Lys 


Glu Val 


65 






70 




Ala Met Pro 


Thr 


Phe 


Leu Phe 


He Lys 






85 






Val Val Gly 


Ala 


Arg 


Lys Asp 


Asp Leu 




100 






105 


Val Gly Ala 


Thr 


Ala 


Ala Ser 


Ala Ser 


115 








120 


<210> 90 










<211> 125 










<212> PRT 










<213> Picea 


mar i ana 






<400> 90 










Met Ala Glu 


Gly 


Asn 


Val Phe 


Ala Cys 


1 




5 




... 


Ser Lys Leu 


Gin 


Glu 


Ala He 


Asp Thr 


20 






25 


Phe Thr Ala 


Thr 


Trp 


Cys Gly 


Pro Cys 


35 








40 


Val Glu Leu 


Ser 


Lys 


Lys Phe 


Pro Glu 


50 






55 




Val Asp Glu 


Leu 


Arg 


Asp Val 


Ala Gin 


65 






70 




Pro Thr Phe 


He 


Phe 


He Lys 


Asp Gly 






85 






Gly Ala Lys 


Lys 


Asp 


Asp Leu 


Glu Arg 




100 






105 


Ala Ala Thr 


Thr 


Thr 


Glu Ala 


Thr Leu 


115 








120 



<210> 91 

<211> 118 

<212> PRT 

<213> Ricinus communis 

<400> 91 



Met 


Ala 


Ala 


Glu 


Glu 


Gly Gin 


Val He 


1 
Trp 


Asn 


Glu 


Gin 


5 
Leu 


Gin Lys 


Gly Asn 








20 






25 


Val 


Asp 


Phe 


Thr 


Ala 


Ser Trp 


Cys Gly 






35 








40 


Phe 


Leu 


Ala 


Glu 


Leu 


Ala Lys 


Lys Leu 




50 








55 




Val 


Asp 


Val 


Asp 


Glu 


Leu Lys 


Thr Val 


65 










70 




Ser 


Met 


Pro 


Thr 


Phe 


Met Phe 


Leu Lys 










85 






Val 


Val 


Gly 


Ala 


Lys 


Lys Asp 


Glu Leu 








100 






105 


Met 


Ala 


Thr 


Ala 


Ser 


Thr 








115 











<210> 92 
<211> 126 
<212> PRT 

<213> triticum aestivum 
<400> 92 



PCT/US01/50240 





45 




Pro Gly Ala 


Val Phe Leu 


Lys 


60 






Ala Glu Lys 


Tyr Asn Val 


Glu 


75 




80 


Asp Gly Ala 


Glu Ala Asp 


Lys 


90 


95 




Gin Asn Tlrr 

wJLii noil j. ill. 


He Val Lvs 


His 




110 




Ala 






His Ser Thr 


- 

Glu Gly Trp 


Arg 


10 


15 




Lys Arg Leu 


Val Ala Val 


Asp 




30 




Arg Val He 


Gly Pro Val 


Phe 


45 




He Phe Phe 


Leu Lys Val 


Asp 


60 






Glu Trp Asp 


Val Glu Ala 


Met 


75 




80 


Lys Ala Val 


Asp Lys Val 


Val 


90 


95 




Lys Val Ala 


Ala Leu Ala 


Ala 


110 




Pro Ala Gin 


Ala 






125 




Gly Cys His 


Thr Val Glu 


Ala 


10 


15 




Asp Thr Lys 


Gly Leu He 


Val 




30 




Pro Cys Arg 


Phe He Ala 


Pro 




45 




Pro Asn Val 


Thr Phe Leu 


Lys 


60 




Ala His Glu 


Trp Ala Val 


Glu 


75 


80 


Glu Gly Lys 


He Met Asp 


Lys 


90 


95 




Gin Gin Thr 


He Ala -Lys 


His 



110 
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Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Ala 


Thr 


Ala 


1 








5 










Glv 


Glu 


Val 


lie 
20 


Ser 


Val 


His 


Ser 


Leu 
25 


Glu 


Glu 


Ala 
35 


Asn 


Ala 


Ala 


Lys 


Lys 
40 


Leu 


Ser 


50 


Cys 


Glv 


Pro 


Cys 


Arg 
55 


He 


Met 


Ala 


Lys 


Lys 


Phe 


Pro 


Ala 


Ala 


Val 


Phe 


65 








70 








Leu 


Lys 


Pro 


lie 


Ala 


Glu 


Gin 


Phe 


Ser 








85 










Leu 


Phe 


Met 


Lys 


Glu 


Gly 


Asp Val Lys 








100 










105 


Lys 


Glu 


Glu 


Leu 


Thr 


Thr 


Lys Val Gly 






115 










120 





PCT/US01/50240 



Thr Ala Ala Ala Val Gly Ala 
10 15 
Glu Gin Trp Thr Met Gin He 
30 

Val Val He Asp Phe Thr Ala 
45 

Ala Pro He Phe Ala Asp Leu 
60 

Leu Lys Val Asp Val Asp Glu 

75 80 
Val Glu Ala Met Pro Thr Phe 
90 95 
Asp Arg Val Val Gly Ala lie 
110 

Leu His Ala Ala Gin 
125 



<210> 93 
<211> 109 
<212> PRT 

<213> Aspergillus nidulans 



<400> 93 



Gly Ala Ser 


Glu 


His 


Val 


Pro Pro 


1 




5 






Glu Lys Val 


Leu 


Asn 


Ala 


Lys Gly 




20 








Thr Trp Cys 


Gly 


Pro 


Cys 


Lys Ala 


35 








40 


Ala Gin Thr 


Tyr 


Thr 


Asp 


Ala Ser 


50 








55 


Leu Ser Glu 


Val 


Ala 


Ala 


Glu Leu 


65 






70 




Leu Leu Phe 


Lys 


Asp 


Gly 


Gin Lys 






85 






Pro Gly Ala 


Leu 


Glu 


Ala 


Gly He 



100 



He Thr Ser Lys Ala Glu Phe Gin 

10 15 
Phe Val Val Val Asp Cys Phe Ala 
25 30 
He Ala Pro Thr Val Glu Lys Phe 
45 

Phe Tyr Gin He Asp Val Asp Glu 
60 

Gly He Arg Ala Met Pro Thr Phe 

75 80 
Val Ser Asp Val Val Gly Ala Asn 

90 95 
Lys Ala Leu Leu Ala 
105 



<210> 94 
<211> 105 
<212> PRT 

<213> Alicyclobacillus 



<400> 94 



Ala Thr 


Met 


Thr 


Leu 


Thr 


Asp Ala 


1 






5 








Asp Lys 


Pro 


Val 


Leu 


Val 


Asp 


Phe 






20 










Arg Met 


Met 


Ala 


Pro 


Val 


Leu 


Glu 




35 










40 


Lys Val 


Thr 


Val 


Ala 


Lys 


Leu 


Asn 


50 










55 




Ser Gin 


Phe 


Gly 


He 


Met 


Ser 


He 


65 








70 






Gly Arg 


Pro 


Val 


Lys 


Gin 


Leu 


He 








85 








Glu Ala 


Gin 


Leu 


Ala 


Asp 


Val 


Leu 



100 



Asn Phe Gin Gin Ala He Gin Gly 

10 15 
Trp Ala Ala Trp Cys Gly Pro Cys 
25 30 
Glu Phe Ala Glu Ala His Ala Asp 
45 

Val Asp Glu Asn Pro Glu Thr Thr 
60 

Pro Thr Leu He Leu Phe Lys Gly 

75 80 
Gly Tyr Gin Pro Lys Glu' Gin Leu 
90 95 

Gin 
105 



<210> 95 
<211> 91 
<212> PRT 

<213> Archaeoglobus fulgidus 



-68- 



WO 02/50289 



PCT/US01/50240 



<400> 95 

Met Val Met Met Lys Leu Phe Thr Ser Pro Thr Cys Pro Tyr Cys Pro 

15 10 15 

Lys Ala Glu Lys Val Val Ser Lys Val Ala Lys Glu Glu Gly Val Leu 

20 25 30 

Ala He Asn Leu Pro Val Asn Thr Asp Glu Gly Leu Lys Glu Ala Leu 

35 40 45 

Lys Phe Gly He Arg Gly Val Pro Ala Leu Val He Asn Asp Lys Tyr 

50 55 60 

Leu He Leu Gly Val Pro Asp Glu Gly Glu Leu Arg Gin Leu He Arg 
65 70 75 80 

Lys Leu Lys Gly Gly Glu Glu Tyr Gly Ala Ser 
85 90 



<210> 96 
<211> 103 
<212> PRT 

<213> Bacillus subtilis 
<400> 96 



Ala He 


Val 


Lys 


Ala 


Thr 


Asp Gin 


1 






5 






Gly Val 


Val 


Leu 


Ala 


Asp 


Phe Trp 






20 








Met He 


Ala 


Pro 


Val 


Leu 


Glu Glu 




35 








40 


Leu Lys 


He 


Val 


Lys 


He 


Asp Val 


50 










55 


Lys Tyr 


Gly 


Val 


Met 


Ser 


He Pro 


65 








70 




Glu Val 


Val 


Glu 


Thr 


Ser 


Val Gly 








85 




Glu Leu 


Val 


Asn 


Lys 


His 


Leu 



100 



Ser Phe Ser Ala Glu Thr Ser Glu 

10 15 
Ala Pro Trp Cys Gly Pro Cys Lys 
25 30 
Leu Asp Gin Glu Met Gly Asp Lys 
45 

Asp Glu Asn Gin Glu Thr Ala Gly 
60 

Thr Leu Leu Val Leu Lys Asp Gly 

75 80 
Phe Lys Pro Lys Glu Ala Leu Gin 
90 95 



<210> 97 

<211> 87 

<212> PRT 

<213> Bacteriophage T4 

<400> 97 



Met Phe Lys 
1 


Val 


Tyr 
5 


Gly 


Tyr Asp 


Cys Asp Asn 


Ala 


Lys 


Arg 


Leu Leu 




20 








Phe lie Asn 


He 


Met 


Pro 


Glu Lys 


35 








40 


Ala Glu Leu 


Leu 


Thr 


Lys 


Leu Gly 


50 








55 


Met Pro Gin 


Val 


Phe 


Ala 


Pro Asp 


65 






70 


Gin Leu Arg 


Glu 


Tyr 


Phe 


Lys 






85 







Ser Asn He His Lys Cys Val Tyr 

10 15 
Thr Val Lys Lys Gin Pro Phe Glu 
25 30 
Gly Val Phe Asp Asp Glu Lys He 
45 

Arg Asp Thr Gin He Gly Leu Thr 
60 

Gly Ser His He Gly Gly Phe Asp 
75 80 



<210> 98 
<211> 117 
<212> PRT 

<213> Borrelia burgdorferi 
<400> 98 

Met Ala He Ser Leu Thr Glu Glu Asp Phe Val Val Lys Val Phe Asp 

15 10 15 

Tyr Lys Asn Asp Lys Glu Trp Ser Phe Arg Gly Asp Arg Pro Ala He 
20 25 30 
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lie Asp 


Phe 


Tyr 


Ala 


Asn Trp Cys 




35 






40 


He Phe 


Glu 


Lys 


Leu 


Ser Lys 


Lys 


50 










Lys Val 


Asp 


Thr 


Asp 


Lys Glu 


Gin 


65 








70 




Gin Ser 


Leu 


Pro 


Thr 
85 


He Leu 


Phe 


Val Ser 


Val 


Gly 
100 


Phe 


Leu Gin 


Glu 


Asp Phe 


Phe 
115 


Gly 


Phe 







Gly Pro Cys Lys Met Leu Ser Pro 
45 

Tyr Glu Asn Ser He Asp Phe Tyr 
60 

Asp He Ser Ser Ala He Gly Val 

75 80 
He Pro Val Asp Gly Lys Pro Lys 

90 95 
Asp Ala Phe Glu Asn He He Lys 
105 110 



<210> 99 
<211> 108 
<212> PRT 

<213> Buchnera aphidicola 
<400> 99 



Met 


Asn 


Lys 


He 


He 


Glu Leu Thr Asp 


Gin 


Asn Phe Glu Glu Gin Val 


1 






5 




10 


15 


Leu 


Asn 


Ser 


Lys 


Ser 


Phe Phe Leu Val 


Asp 


Phe Trp Ala Gin Trp Cys 








20 




25 




30 


Asn 


Pro 


Cys 


Lys 


He 


Leu Ala Pro He 


Leu 


Glu Glu He Ser Lys Glu 






35 




40 




45 


Tyr 


Ser 


Asn 


Lys 


Val 


He Val Gly Lys 


Leu 


Asn He Glu Glu Asn Pro 


50 








55 




60 


Asn 


Thr 


Ala 


Pro 


Val 


Tyr Ser He Arg 


Ser 


He Pro Thr Leu Leu Leu 


65 










70 




75 80 


Phe 


Asn 


Asn 


Ser 


Glu 


Val Leu Ala Thr 


Lys 


Val Gly Ala Val Ser Lys 










85 




90 


95 


Leu 


Glu 


Leu 


Lys 


Glu 


Phe Leu Asp Glu 


Asn 


He Asn 








100 




105 







<210> 100 
<211> 108 
<212> PRT 
<213> aphidicola 

<400> 100 



Met 


Asn 


Lys 


lie 


He 


Glu Leu Thr Asp 


Gin 


Asn Phe Glu Lys Glu Val 


1 








5 




10 


15- 


Leu 


Glu 


His 


Lys 


Ser 


Phe Val Leu Val 


Asp 


Phe Trp Ala Glu Trp Cys 








20 




25 




30 


Asn 


Pro 


Cys 


Lys 


He 


Leu Ala Pro He 


Leu 


Glu Glu He Ala Gin Glu 






35 




40 




45 


Tyr Phe 


Asn 


Lys 


He 


Lys Val Gly Lys 


Leu 


Asn He Glu Lys Asn Pro 




50 








55 




60 


Asn 


Thr 


Ala 


Pro 


lie 


Tyr Ser He Arg 


Gly 


He Pro Ala Leu Leu Leu 


65 










70 




75 80 


Phe 


His 


Gly 


Arg 


Glu 


Val Leu Ala Thr 


Lys 


Val Gly Ala He Ser Lys 










85 




90 


95 


Leu 


Gin 


Leu 


Lys 


Asp 


Phe Leu Asp Glu 


Asn 


He Lys 








100 




105 







<210> 101 
<211> 108 
<212> PRT 

<213> Chlorobium limicola 
<220> 

<221> VARIANT 

<222> 16, 17, 38, 42, 45, 54, 55, 58, 66, 72, 75, 79, 80, 81, 94, 
99, 103 
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<223> Xaa = Any Amino Acid 
<400> 101 

Ala Gly Lys Tyr Phe Glu Ala Thr Asp Lys Asn Phe Gin Thr Glu Xaa 

15 10 15 

Xaa Asp Ser Asp Lys Ala Val Leu Val Asp Phe Trp Ala Ser Trp Cys 

20 25 30 

Gly Pro Cys Met Met Xaa Gly Pro Val Xaa Glu Gin Xaa Ala Asp Asp 

35 40 45 

Tyr Glu Gly Lys Ala Xaa Xaa Ala Lys Xaa Asn Val Asp Glu Asn Pro 

50 55 60 

Asn Xaa Ala Gly Gin Tyr Gly Xaa Arg Ser Xaa Pro Thr Met Xaa Xaa 
65 70 " 75 80 

Xaa Lys Gly Gly Lys Val Val Asp Gin Met Val Gly Ala Xaa Pro Lys 

85 90 95 

Asn Met Xaa Ala Lys Lys Xaa Asp Glu His lie Gly 
100 105 



<210> 102 
<211> 102 
<212> PRT 

<213> Chlamydia muridarum 
<400> 102 



Met Val 


Gin 


He Val Ser 


Gin Asp 


Asn 


Phe 


Ala Asp Ser lie Ala Ser 


1 




5 




10 


15 


Gly Leu 


Val 


Leu Val Asp 


Phe Phe 


Ala 


Glu 


Trp Cys Gly Pro Cys Lys 






20 




25 




30 


Met Leu 


Thr 


Pro Val Leu 


Glu Ala 


Leu 


Ala 


Ala Glu Leu Pro Tyr Val 




35 




40 






45 


Thr lie 


Leu 


Lys Leu Asp 


He Asp 


Ala 


Ser 


Pro Arg Pro Ala Glu ■ Gin 


50 






55 






60 


Phe Gly 


Val 


Ser Ser He 


Pro Thr 


Leu 


lie 


Leu Phe Lys Asp Gly Lys 


65 




70 








75 80 


Glu Val 


Glu 


Arg Ser Val 


Gly Leu 


Lys Asp 


Lys Asp Ser Leu Val Lys 






85 






90 


95 


Leu lie 


Ser 


Lys His Gin 














100 











<210> 103 
<211> 102 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 103 

Met Val Lys He He Ser Ser Glu Asn Phe Asp Ser Phe He Ala Ser 

15 10 15 

Gly Leu Val Leu Val Asp Phe Phe Ala Glu Trp Cys Gly Pro Cys Arg 

20 25 30 

Met Leu Thr Pro He Leu Glu Asn Leu Ala Ala Glu Leu Pro His Val 

35 40 45 

Thr He Gly Lys He Asn He Asp Glu Asn Ser Lys Pro Ala Glu Thr 

50 55 60 

Tyr Glu Val Ser Ser He Pro Thr Leu He Leu Phe Lys Asp, Gly Asn 
65 70 75 80 

Glu Val Ala Arg Val Val Gly Leu Lys Asp Lys Glu Phe Leu Thr Asn 

85 90 " 95 

Leu He Asn Lys His Ala 
100 



<210> 104 
<211> 102 
<212> PRT 
<213> Psittaci 
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<400> 104 














Met Val 


Lys 


Val 


Val 


Ser 


Ala 


Glu Asn 


Phe Asn Ser Phe He Ala Thr 


1 




5 








10 15 


Gly Leu 


Val 


Leu 


lie 


Asp 


Phe 


Phe Ala 


Glu Trp Cys Gly Pro Cys Lys 






20 








25 


30 


Met Leu 


Thr 


Pro 


Val 


Leu 


Glu 


Ser Leu 


Glu Ala Glu Val Ser Ser Val 




35 










40 


45 


Leu lie 


Gly 


Lys 


Val 


Asn 


He 


Asp Asp 


His Pro Ala Pro Ala Glu Gin 


50 










55 




60 


Tyr Gly 


Val 


Ser 


Ser 


He 


Pro 


Thr Leu 


He Leu Phe Lys Asp Gly Lys 


65 








70 






75 80 


Glu Val 


Asp 


Arg 


Val 


Val 


Gly 


Leu Lys 


Asp Lys Asp Ser Leu He Arg 








85 








90 95 


Leu lie 


Asn 


Gin 


His 


Ser 












100 













<210> 105 
<211> 102 
<212> PRT 

<213> Chlamydia trachomatis 



<400> 105 



Met Val 


Gin 


Val 


Val 


Ser 


Gin 


Glu Asn 


Phe Ala Asp Ser He Ala Ser 


1 






5 








10 15 


Gly Leu 


Val 


Leu 


He 


Asp 


Phe 


Phe Ala 


Glu Trp Cys Gly Pro Cys Lys 






20 








25 


30 


Met Leu 


Thr 


Pro 


Val 


Leu 


Glu 


Ala Leu 


Ala Ala Glu Leu Pro His Val 




35 










40 


45 


Thr He 


Leu 


Lys 


Val 


Asp 


He 


Asp Ser 


Ser Pro Arg Pro Ala Glu Gin 


50 










55 




60 


Tyr Ser 


Val 


Ser 


Ser 


He 


Pro 


Thr Leu 


He Leu Phe Lys Asp Gly Lys 


65 








70 






75 80 


Glu Val 


Glu 


Arg 


Ser 


Val 


Gly 


Leu Lys 


Asp Lys Asp Ser Leu He Lys 








85 








90 95 


Leu He 


Ser 


Lys 


His 


Gin 












100 













<210> 106 * 
<211> 105 
<212> PRT 

<213> Cornybacterium nephridii 



<400> 106 



Ala Thr 


Val 


Lys 


Val 


Asp 


Asn 


Ser Asn 


Phe Gin Ser Asp Val Leu Gin 


1 






5 








10 15 


Ser Ser 


Glu 


Pro 


Val 


Val 


Val 


Asp Phe 


Trp Ala Glu Trp Cys Gly Pro 






20 








25 


30 


Cys Lys 


Met 


He 


Ala 


Pro 


Ala 


Leu Asp 


Glu He Ala Thr Glu Met Ala 


35 










40 


45 


Gly Gin 


Val 


Lys 


He 


Ala 


Lys 


Val Asn 


He Asp Glu Asn Pro Glu Leu 


50 








55 




60 


Ala Ala 


Gin 


Phe 


Gly 


Val 


Arg 


Ser He 


Pro Thr Leu Leu Met Phe Lys 


65 








70 






75 80 


Asp Gly 


Glu 


Leu 


Ala 


Ala 


Asn 


Met Val 


Gly Ala Ala Pro Lys, Ser Arg 








85 








90 95 


Leu Ala 


Asp 


Trp 


He 


Lys 


Ala 


Ser Ala 








100 








105 





<210> 107 
<211> 107 
<212> PRT 

<213> Cornybacterium nephridii 
<400> 107 
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Ser Ala 


TVi-r 

J- 1 IX 


JL -L 


Val 


Asn 


Thr 


Thy Arti 

•X 111 «0 


Glu Asn Phe» din Ala Asn Val 


1 






c 








10 1R 




IV 1 ss 




J. Ill 


Ova 

Sz X. \J 


Val 
v ctx 


T.^n Val 

■UCU Val 


J\cr\ PVlf^ Tr-rk Til a Ol "V Ttti rHrcj 
nop riiu up nia VJiy up v—yss 






z U 














Cys 


Lys 


x\l Ct 


Tl 


ZVl a 
Aid 


XT J. \J V ct X 


Tif^n C5l ii Oln T.<aii .Q^t" Aqti flln 
ucu vjiu uiu jjcu wci noli vai u 












Art 




TH/V 11 a 


f2l tr 


Lys 


V dX 




Tl 
lie 


Va 1 Ta^g 
Val uyo 


Val 7\ cn Val TVit* .Qat* fxra dl n 
val nop vctl J. ill OCi vyo vjiu 


c n 
DU 










c c 

DO 




bU 


Asp Thr 


Ala 


Val 


Lys 


Tyr 


Asn 


He Arg 


Asn He Pro Ala Leu Leu Met 


65 








70 






75 80 


Phe Lys 


Asp 


Gly 


Glu 


Val 


Val 


Ala Gin 


Gin Val Gly Ala Ala Pro Arg 








85 








90 95 


Ser Lys 


Leu 


Ala 


Ala 


Phe 


He 


Asp Gin 


Asn He 






100 








105 





<210> 108 

<211> 145 

<212> PRT 

<213> Cornybacterium nephridii 



<400> 108 



Met 


He 


He Val Cys 


Ala 


Ser 


Cys 


1 




5 








Glu Lys 


Leu Ala Val 


His 


Pro 


Asn 






20 








Leu 


Pro 


Leu Glu Pro 


He 


Glu 


Leu 






35 






40 


He 


Ser 


Asn Ser Asp 


Leu 


Pro 


Val 




50 






55 




Cys Gly 


Pro Cys Lys 


Met 


Met 


Ala 


65 






70 






Gin 


Asn 


Pro Tyr Val 


Val 


Phe 


Ala 






85 








Arg 


Leu 


Ser Ala Ala 


Phe 


Asn 


Val 






100 








Met 


Asn 


Lys Thr Thr 


Glu 


Val 


Ala 






115 






120 


Leu 


Glu 


Leu Gin Gin 


Trp Leu 


Asp 




130 






135 





Asn 
145 



Gly Ala Lys Asn Arg Val Pro Glu 

10 15 
Cys Gly Gin Cys His Gin Ala Leu 
25 30 
Asn Glu Gin Asn Phe Ser Asn Phe 
45 

Leu He Asp Leu Trp Ala Glu Trp 
60 

Pro His Phe Ala Gin Val Ala Lys 

75 80 
Lys He Asp Thr Glu Ala Asn Pro 

90 95 
Arg Ser He Pro Thr Leu Val Leu 
105 110 
Arg He Ser Gly Ala Leu Arg. Thr 
125 

Gin Gin Leu Gin Gin Gin Gin Gly 
140 



<210> 109 

<211> 107 

<212> PRT 

<213> Chromatium vinosum 
<220> 

<221> VARIANT 

<222> 17, 38, 42, 55, 58, 60, 72, 107 

<223> Xaa = Any Amino Acid 



<400> 109 



Ser Asp 


Ser 


He 


Val 


His Val 


Thr 


Asp 


Asp Ser Phe Glu Glu Glu Val 


1 






5 








10 15 


Xaa Lys 


Ser 


Pro 


Asp 


Pro Val 


Leu 


Val 


Asp Tyr Trp Ala Asp Trp Cys 






20 








25 


30 


Gly Pro 


Cys 


Lys 


Met 


Xaa Ala 


Pro 


Val 


Xaa Asp Glu He Ala Asp Glu 




35 








40 




45 


Tyr Ala 


Gly 


Arg 


Val 


Lys Xaa 


Ala 


Lys 


Xaa Asn Xaa Asp Glu Asn Pro 


50 








55 






60 


Asn Thr 


Pro 


Pro 


Arg 


Tyr Gly 


Xaa 


Arg 


Gly He Pro Thr Leu Met Leu 


65 








70 






75 80 


Phe Arg 


Gly 


Gly 


Glu 


Val Glu 


Ala 


Thr 


Lys Val Gly Ala Val Ser Lys 



85 90 95 
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Ser Gin Leu Thr Ala Phe Leu Asp Ser Asn Xaa 
100 105 



<210> 110 
<211> 107 
<212> PRT 

<213> Clostridium litorale 
<400> 110 

Met Leu Met Leu Asp Lys Asp Thr Phe Lys Thr Glu Val Leu Glu Gly 

1 5 10 15 

Thr Gly Tyr Val Leu Val Asp Tyr Phe Ser Asp Gly Cys Val Pro Cys 

20 25 30 

Lys Ala Leu Met Pro Ala Val Glu Glu Leu Ser Lys Lys Tyr Glu Gly 

35 40 45 

Arg Val Val Phe Ala Lys Leu Asn Thr Thr Gly Ala Arg Arg Leu Ala 

50 55 ^ 60 

lie Ser Gin Lys lie Leu Gly Leu Pro Thr Leu Ser Leu Tyr Lys Asp 
65 70 75 80 

Gly Val Lys Val Asp Glu Val Thr Lys Asp Asp Ala Thr lie Glu Asn 

85 90 95 

He Glu Ala Met Val Glu Glu His He Ser Lys 
100 105 



<210> 111 
<211> 40 
<212> PRT 

<213> Clostridium sporogenes 
<400> 111 

Met Leu Val Leu Asp Lys Lys Thr Phe Glu Glu Glu Val Leu Lys Thr 

1 5 10 15 

Lys Gly Tyr Val Leu Val Asp Tyr Phe Gly Asp Gly Cys Val Pro Cys 

20 25 30 

Glu Ala Leu Met Pro Asp Val Glu 
35 40 



<210> 112 
<211> 33 
<212> PRT 

<213> Clostridium sticklandii 
<400> 112 

Met Phe Glu Leu Asp Lys Asp Thr Phe Glu Thr Glu Val Leu Gin Gly 

1 5 10 15 

Thr Gly Tyr Val Leu Val Asp Phe Trp Ser Glu Gly Cys Glu Pro Cys 
20 25 " 30 

Lys 



<210> 113 
<211> 106 
<212> PRT 

<213> Coprinus comatus 
<400> 113 

Met Val Gin Val He Ser Asn Leu Asp Glu Phe Asn Lys Leu Thr Asn 

15 10 15 

Ser Gly Lys He He He He Asp Phe Trp Ala Thr Trp Cys Gly Pro 

20 25 • 30 

Cys Arg Val He Ser Pro He Phe Glu Lys Phe Ser Glu Lys Tyr Gly 

35 40 45 

Ala Asn Asn He Val Phe Ala Lys Val Asp Val Asp Thr Ala Ser Asp 
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50 55 60 

lie Ser Glu Glu Ala Lys lie Arg Ala Met Pro Thr Phe Gin Val Tyr 
65 70 75 80 

Lys Asp Gly Gin Lys He Asp Glu Leu Val Gly Ala Asn Pro Thr Ala 

85 90 95 

Leu Glu Ser Leu Val Gin Lys Ser Leu Ala 
100 105 



<210> 114 
<211> 105 
<212> PRT 

<213> Dictyostelium discoideum 



<400> 114 



Met Ser 


Asn 


Arg 


Val He His 


Val 


1 






5 




His Leu 


Arg 


Asp 


Glu Arg Val 


Val 






20 






Gly Pro 


Cys 


Arg 


Ala He Ser 


Pro 




35 






40 


Phe He 


Thr 


Phe 


Thr Phe Leu 


His 


50 






55 




His Pro 


He 


Val 


Ser Lys He 


Lys 


65 






70 


Arg Asn 


Gly 


Ser 


Lys Val Ser 


Glu 








85 




Leu Arg 


Ser 


Thr 


Leu Glu Ala 


Asn 






100 







Ser Ser Cys Glu Glu Leu Asp Lys 

10 15 
Val Asp Phe Ser Ala Val Trp Cys 
25 30 
Val Phe Glu Lys Leu Ser Asn Glu 
45 

Val Asp He Asp Lys Leu Asn Val 
60 

Ser Val Pro Thr Phe His Phe Tyr 

75 80 
Phe Ser Gly Ala Ser Glu Ser He 
90 95 

Lys 
105 



<210> 115 
<211> 88 
<212> PRT 

<213> Dictyostelium discoideum 
<400> 115 

Met Ser Arg Val He His He Ser 

1 5 
Leu Gin Ala Glu Arg Leu Val He 
20 

Pro Cys Arg Ala He Ser Pro Val 

35 40 
Val Thr Phe Thr Phe Val His Val 

50 55 
Pro He Val Lys Glu He Arg Ser 
65 70 
Asn Gly Ala Lys Val Ser Glu Phe 
85 



Ser Asn Glu Glu' Leu Asp Lys His 

10 15 
Asp Phe Ser Ala Ala Trp Cys Gly 
25 30 
Phe Glu Lys Leu Ser Asn Glu Phe 
45 

Asp He Asp Lys Leu Ser Gly His 
60 

Val Pro Thr Phe Tyr Phe Tyr Arg 
75 80 



<210> 116 
<211> 88 
<212> PRT 

<213> Dictyostelium discoideum 
<400> 116 

Met Ser Arg Val He His He Ser 

1 5 
Leu Gin Ala Glu Arg Leu Val He 
20 

Pro Cys Arg Ala He Ser Pro Val 

35 40 
Val Thr Phe Thr Phe Val His Val 

50 55 
Pro lie Val Lys Glu He Arg Ser 
65 * 70 



Ser Asn Glu Glu Leu Asp Lys His 

10 15 
Asp Phe Ser Ala Ala Trp Cys Gly 
25 30 
Phe Glu Lys Leu Ser Asn Glu Phe 
45 

Asp He Asp Lys Leu Ser Gly His 
60 

Val Pro Thr Phe Tyr Phe Tyr Arg 
75 80 
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Asn Gly Ala Lys Val Ser Glu Phe 
85 



<210> 117 
<211> 108 
<212> PRT 

<213> E coli, salmonella typhimurium 
<400> 117 

Ser Asp Lys lie He His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 

15 10 15 

Leu Lys Ala Asp Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp Cys 

20 25 30 

Gly Pro Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp Glu 

35 40 45 

Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn Pro 

50 55 60 

Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu Leu 
65 70 75 80 

Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 

85 90 95 

Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala 
100 105 



<210> 118 

<211> 105 

<212> PRT 

<213> Synechocystis 



<400> 118 



Met Ala 


Val 


Lys Lys 


Gin Phe Ala 


1 




5 




Ser Pro 


Lys 


Pro Val 


Leu Val Asp 






20 


Cys Gin 


Met 


Met Ala 


Pro lie Leu 


35 




40 


Gin Gin 


He 


Gin Val 


Val Lys He 


50 






55 


Ala Thr 


Gin 


Tyr Gin 


He Gin Ser 


65 




70 


Gin Gly 


Gin 


Pro Val 


His Arg Met 






85 




Leu He 


Gin 


Gin Leu 


Gin Val Phe 






100 





Asn Phe Ala Glu Met Leu Ala Gly 

10 15 
Phe Tyr Ala Thr Trp Cys Gly Pro 
25 30 
Glu Gin Val Gly Ser His Leu Arg 
45 

Asp Thr Asp Lys Tyr Pro Ala He 
60 

Leu Pro Thr Leu Val Leu Phe Lys 

75 80 
Glu Gly Val Gin Gin Ala Ala Gin 
90 95 

Val 
105 



<210> 119 
<211> 139 
<212> PRT 
<213> E . coli 



<400> 119 



Met Asn 


Thr 


Val Cys 


Thr His Cys 


1 




5 




Asp Arg 


He 


Glu Asp 


Ala Ala Lys 






20 




Phe Asp 


Gly 


Glu Val 


He Asn Ala 




35 




40 


Leu Lys 


Asp 


Asp Leu 


Pro Val Val 


50 






55 


Gly Pro 


Cys 


Arg Asn 


Phe Ala Pro 


65 






70 


Arg Ser 


Gly 


Lys Val 


Arg Phe Val 






85 




Glu Leu 


Ser 


Ser Arg 


Phe Gly He 



Gin Ala He Asn Arg He Pro Asp 

10 . 15 

Cys Gly Arg Cys Gly His Asp Leu 
25 30 
Thr Gly Glu Thr Leu Asp Lys Leu 
45 

He Asp Phe Trp Ala Pro Trp Cys 
60 

He Phe Glu Asp Val Ala Gin Glu 

75 80 
Lys Val Asn Thr Glu Ala Glu Arg 

90 95 
Arg Ser He Pro Thr He Met He 
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100 

Phe Lys Asn Gly Gin Val Val Asp 
115 * 120 
Ala Pro Phe Asp Ser Trp Leu Asn 
130 135 



105 110 
Met Leu Asn Gly Ala Val Pro Lys 
125 

Glu Ser Leu 



<210> 120 
<211> 110 
<212> PRT 

<213> Eubacterium acidaminophilum 
<400> 120 



Met Ser 


Ala 


Leu 


Leu 


Val 


Glu 


He 


Asp Lys Asp Gin Phe Gin Ala Glu 


1 






5 








10 15 


Val Leu 


Glu 


Ala 


Glu 


Gly 


Tyr 


Val 


Leu Val Asp Tyr Phe Ser Asp Gly 






20 










25 30 


Cys Val 


Pro 


Cys 


Lys 


Ala 


Leu 


Met 


Pro Asp Val Glu Glu Leu Ala Ala 


35 










40 


45 


Lys Tyr 


Glu 


Gly 


Lys 


Val 


Ala 


Phe 


Arg Lys Phe Asn Thr Ser Ser Ala 


50 










55 




60 


Arg Arg 


Leu 


Ala 


lie 


Ser 


Gin 


Lys 


He Leu Gly Leu Pro Thr He Thr 


65 








70 






75 80 


Leu Tyr 


Lys 


Gly 


Gly 


Gin 


Lys 


Val 


Glu Glu Val Thr Lys Asp Asp Ala 








85 








90 95 


Thr Arg 


Glu 


Asn 


He 


Asp 


Ala 


Met 


He Ala Lys His Val Gly 



100 105 110 



<210> 121 
<211> 107 
<212> PRT 

<213> Haemophilus influenzae 
<400> 121 



Met 


Ser 


Glu 


Val 


Leu 


His 


He 


Asn 


Asp Ala Asp Phe Glu Ser Val 


Val 


1 








5 








10 15 




Val 


Asn 


Ser 


Asp 
20 


He 


Pro 


He 


Leu 


Leu Asp Phe Trp Ala Pro Trp 
25 30 


Cys 


Gly Pro 


Cys 


Lys 


Met 


He 


Ala 


Pro 


Val Leu Asp Glu Leu Ala Pro 


Glu 






35 










40 


45 




Phe 


Ala 
50 


Gly 


Lys 


Val 


Lys 


He 
55 


Val 


Lys Met Asn Val Asp Asp Asn 
60 


Gin 


Ala 


Thr 


Pro 


Ala 


Gin 


Phe 


Gly 


Val 


Arg Ser He Pro Thr Leu Leu 


Leu 


65 










70 




75 


80 


He 


Lys 


Asn 


Gly 


Gin 


Val 


Val 


Ala 


Thr Gin Val Gly Ala Leu Pro 


Lys 








85 








90 95 




Thr 


Gin 


Leu 


Ala 
100 


Asn 


Phe 


He 


Asn 


Gin His He 
105 





<210> 122 
<211> 167 
<212> PRT 

<213> Haemophilus influenzae 
<400> 122 

Met Lys lie Lys Lys Leu Leu Lys Asn Gly Leu Ser Leu Phe Leu Thr 

15 10 15 

Phe He Val He Thr Ser lie Leu Asp Phe Val Arg Arg Pro Val Val 

20 25 30 

Pro Glu Glu He Asn Lys lie Thr Leu Gin Asp Leu Gin Gly Asn Thr 

35 40 45 

Phe Ser Leu Glu Ser Leu Asp Gin Asn Lys Pro Thr Leu Leu Tyr Phe 

50 55 ~ 60 

Trp Gly Thr Trp Cys Gly Tyr Cys Arg Tyr Thr Ser Pro Ala He Asn 
65 70 75 80 
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Ser Leu Ala Lys Glu Gly Tyr Gin 
85 

Gly Asn Glu Ala Asp Val Asn Asp 
100 

Phe Thr Thr Val Asn Asp Pro Lys 
115 120 
lie Asn Val Thr Pro Thr lie Val 

130 135 
Leu Val Thr Thr Gly Leu Thr Ser 
145 150 
Phe Phe Ala Glu Phe Phe Gly 
165 



Val Val Ser Val Ala Leu Arg Ser 

90 95 
Tyr Leu Ser Lys Asn Asp Tyr His 
105 110 
Gly Glu Phe Ala Glu Arg Trp Gin 
125 

Leu Leu Ser Lys Gly Lys Met Asp 
140 

Tyr Trp Gly Leu Lys Val Arg Leu 
155 160 



<210> 123 
<211> 106 
<212> PRT 

<213> Helicobacter pylori 
<400> 123 



Met 


Ser 


His 


Tyr 


lie 


Glu 


Leu 


Thr Glu Glu Asn Phe Glu Ser Thr He 


1 






5 






10 15 


Lys 


Lys 


Gly 


Val 
20 


Ala 


Leu 


Val 


Asp Phe Trp Ala Pro Trp Cys Gly Pro 
25 30 


Cys 


Lys 


Met 
35 


Leu 


Ser 


Pro 


Val 


He Asp Glu Leu Ala Ser Glu Tyr Glu 
40 45 


Gly 


Lys 


Ala 


Lys 


He 


Cys 


Lys 


Val Asn Thr Asp Glu Gin Glu Glu Leu 


50 










55 


60 


Ser 


Ala 


Lys 


Phe 


Gly 


He 


Arg 


Ser He Pro Thr Leu Leu Phe Thr Lys 


65 










70 




75 80 


Asp 


Gly 


Glu 


Val 


Val 
85 


His 


Gin 


Leu Val Gly Val Gin Thr Lys Val Ala 
90 95 


Leu 


Lys 


Glu 


Gin 
100 


Leu 


Asn 


Lys 


Leu Leu Gly 
105 



<210> 124 
<211> 103 
<212> PRT 

<213> Listeria monocytogenes 
<400> 124 



Met Val 


Lys 


Glu 


He 


Thr 


Asp 


Ala 


1 




5 






Gly Leu 


Val 


Leu 


Thr 


Asp 


Phe 


Trp 






20 








Met Val 


Ala 


Pro 


Val 


Leu 


Glu 


Glu 




35 










40 


Leu Lys 


He 


Val 


Lys 


Met 


Asp 


Val 


50 










55 




Ser Phe 


Gly 


Val 


Met 


Ser 


He 


Pro 


65 






70 






Glu Val 


Val 


Glu 


Thr 


He 


He 


Gly 








85 






Glu Val 


He 


Asn 


Lys 


Tyr 


Val 








100 









Thr Phe Glu Gin Glu Thr Ser Glu 

10 15 
Ala Thr Trp Cys Gly Pro Cys Arg 
25 30 
He Gin Glu Glu Arg Gly Glu Ala 
45 

Asp Glu Asn Pro Glu Thr Pro Gly 
60 

Thr Leu Leu He Lys Lys Asp Gly 

75 80 
Tyr Arg Pro Lys Glu Glu Leu Asp 
90 95 



<210> 125 
<211> 85 
<212> PRT 

<213> Methoanococus jannaschii 
<400> 125 

Met Ser Lys Val Lys He Glu Leu Phe Thr Ser Pro Met Cys Pro His 

1 5 10 ^15 

Cys Pro Ala Ala Lys Arg Val Val Glu Glu Val Ala Asn Glu Met Pro 
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20 25 30 

Asp Ala Val Glu Val Glu Tyr He Asn Val Met Glu Asn Pro Gin Lys 

35 40 45 

Ala Met Glu Tyr Gly He Met Ala Val Pro Thr He Val He Asn Gly 

50 55 60 

Asp Val Glu Phe He Gly Ala Pro Thr Lys Glu Ala lieu Val Glu Ala 
65 70 75 80 

He Lys Lys Arg Leu 
85 



<210> 126 

<211> 102 

<212> PRT 

<213> Mycoplasma genitalium 

<400> 126 



Met Val 


Thr 


Glu 


He 


Arg 


Ser 


Leu 


Lys Gin Leu Glu Glu 


He Phe 


Ser 


1 






5 








10 


15 




Ala Lys 


Lys 


Asn 


Val 


lie 


Val Asp 


Phe Trp Ala Ala Trp 


Cys Gly Pro 






20 










25 


30 




Cys Lys 


Leu 


Thr 


Ser 


Pro 


Glu 


Phe 


Gin Lys Ala Ala Asp 


Glu Phe 


Ser 


35 










40 


45 






Asp Ala 


Gin 


Phe 


Val 


Lys 


Val 


Asn 


Val Asp Asp His Thr 


Asp lie 


Ala 


50 










55 




60 






Ala Ala 


Tyr 


Asn 


He 


Thr 


Ser 


Leu 


Pro Thr He Val Val 


Phe Glu 


Asn 


65 






70 






75 




80 


Gly Val 


Glu 


Lys 


Lys 
85 


Arg 


Ala 


lie 


Gly Phe Met Pro Lys 
90 


Thr Lys 
95 


He 


He Asp 


Leu 


Phe 
100 


Asn 


Asn 













<210> 127 
<211> 458 
<212> PRT 

<213> mycobacterium leprae 



<400> 127 



Met 


Asn 


Thr 


Thr 


Pro 


Ser 


Ala His 


Glu Thr He His Glu 


Val He Val 


1 








5 






10 


15 


He 


Gly 


Ser 


Gly 


Pro 


Ala 


Gly Tyr 


Thr Ala Ala Leu Tyr 


Ala Ala Arg 








20 








25 


30 


Ala 


Gin 


Leu 


Thr 


Pro 


Leu 


Val Phe 


Glu Gly Thr Ser Phe 


Gly Gly Ala 






35 








40 


45 




Leu 


Met 


Thr 


Thr 


Thr 


Glu 


Val Glu 


Asn Tyr Pro Gly Phe 


Arg Asn Gly 




50 










55 


60 




He 


Thr 


Gly 


Pro 


Glu 


Leu 


Met Asp 


Asp Met Arg Glu Gin 


Ala Leu Arg 


65 










70 




75 


80 


Phe 


Gly 


Ala 


Glu 


Leu 


Arg 


Thr Glu 


Asp Val Glu Ser Val 


Ser Leu Arg 










85 






90 


95 


Gly 


Pro 


He 


Lys 


Ser 


Val 


Val Thr 


Ala Glu Gly Gin Thr 


Tyr Gin Ala 








100 








105 


110 


Arg 


Ala 


Val 


He 


Leu 


Ala 


Met Gly 


Thr Ser Val Arg Tyr 


Leu Gin He 






115 








120 


125 




Pro 


Gly 


Glu 


Gin 


Glu 


Leu 


Leu Gly 


Arg Gly Val Ser Ala 


Cys Ala Thr 




130 










135 


140 




Cys 


Asp 


Gly 


Ser 


Phe 


Phe 


Arg Gly 


Gin Asp He Ala Val 


He Gly Gly 


145 










150 




155 


160 


Gly 


Asp 


Ser 


Ala 


Met 


Glu 


Glu Ala 


Leu Phe Leu Thr Arg 


Phe Ala Arg 










165 






170 


175 


Ser 


Val 


Thr 


Leu 


Val 


His 


Arg Arg 


Asp Glu Phe Arg Ala 


Ser Lys He 








180 








185 


190 


Met 


Leu 


Gly 


Arg 


Ala 


Arg 


Asn Asn 


Asp Lys He Lys Phe 


He Thr Asn 






195 








200 


205 




His 


Thr 


Val 


Val 


Ala 


Val 


Asn Gly 


Tyr Thr Thr Val Thr 


Gly Leu Arg 




210 










215 


220 
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Leu 


ArQ 


Asn Thr 


225 




Val 


Phe 


Val Ala 


Val 


Val 


Asp He 






260 


Thr 


Ser 


Thr Ser 






275 




Thr 


Tyr Arq 




290 




Ala 


He 


Asp Ala 


305 






Asn 


Glu 


Thr Thr 


Asp 


Tro 


Ser Thr 






340 


Glu 


Val 


Thr Asp 






355 


Pro 


Val 


Leu Val 




370 




Val 


Ala 


Pro Val 


385 






Thr 


Val 


Ala Lys 


Phe 


Gin 


Val Val 






420 


Pro 


Val 


Lys Arg 






435 


Asp 


Leu 


Ser Asp 




450 





Thr Thr 
230 
He Gly 
245 

Asp Pro 

Met Asp 

Gin Ala 

Glu Arg 
310 
Glu Glu 
325 

Ala Met 

Ala Ser 

Asp Phe 

Leu Glu 
390 
Leu Asp 
405 

Ser He 
He Val 
Val Val 



Gly Glu Glu Thr 
His Glu 
Asp Gly 



Gly Val 
280 
He Thr 
295 

Trp Leu 

Thr Gly 

Thr Asp 

Phe Phe 
360 
Trp Ala 
375 

Glu He 

Val Asp 

Pro Thr 

Gly Ala 
440 
Pro Asn 
455 



Pro Arg 
250 
Tyr Val 
265 

Phe Ala 



Ala Ala 

Ala Glu 

Asp Val 
330 
Ala Lys 
345 

Ala Asp 

Thr Trp 

Ala Ser 

Thr Asn 
410 
Met He 
425 

Lys Gly 
Leu Asn 



Thr Leu Val 
235 

Ser Ser Leu 

Leu Val Lys 

Ala Gly Asp 
285 

Gly Ser Gly 

300 
His Ala Gly 
315 

Asp Ser Thr 

Asn Ala Gly 

Val Leu Ser 
365 

Cys Gly Pro 

380 
Glu Gin Arg 
395 

Pro Glu Met 

Leu Phe Gin 

Lys Ala Ala 
445 



Val Thr Gly 
240 

Val Ser Asp 

255. 
Gly Arg Thr 
270 

Leu Val Asp 

Cys Ala Ala 

Ser Lys Ala 
320 

Asp Thr Thr 

335 
Val Thr He 
350 

Ser Asn Lys 

Cys Lys Met 

Asn Gin Leu 
400 

Ala Arg Glu 

415 
Gly Gly Gin 
430 

Leu Leu Arg 



<210> 128 

<211> 102 

<212> PRT 

<213> Mycoplasma pneumoniae 

<400> 128 



Met 


Val 


Thr 


Glu 


He 


Lys 


Ser Leu Lys Gin 


Leu Gly Glu Leu Phe Ala 


1 








5 




10 


15 


Ser 


Asn 


Asn 


Lys 


Val 


He 


He Asp Phe Trp 


Ala Glu Trp Cys Gly Pro 








20 






25 


30 


Cys 


Lys 


He 


Thr 


Gly 


Pro 


Glu Phe Ala Lys 


Ala Ala Ser Glu Val Ser 






35 








40 


45 


Thr 


Val 


Ala 


Phe 


Ala 


Lys 


Val Asn Val Asp 


Glu Gin Thr Asp He Ala 




50 










55 


60 


Ala 


Ala 


Tyr 


Lys 


lie 


Thr 


Ser Leu Pro Thr 


He Val Leu Phe Glu Lys 


65 










70 




75 80 


Gly Gin 


Glu 


Lys 


His 


Arg 


Ala He Gly Phe 


Met Pro Lys Ala Lys He 










85 




90 


95 


Val 


Gin 


Leu 


Val 


Ser 


Gin 












100 











<210> 129 
<211> 112 
<212> PRT 

<213> Mycobacterium smegmatis 
<400> 129 

Met Ser Glu Asp Ser Ala Thr Val Ala Val Thr Asp Asp Ser Phe Ser 

1 5 10 15 

Thr Asp Val Leu Gly Ser Ser Lys Pro Val Leu Val Asp Phe Trp Ala 

20 25 30 

Thr Trp Cys Gly Pro Cys Lys Met Val Ala Pro Val Leu Glu Glu He 

35 40 45 

Ala Ala Glu Lys Gly Asp Gin Leu Thr Val Ala Lys He Asp Val Asp 
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50 55 60 

Val Asp Ala Asn Pro Ala Thr Ala Arg Asp Phe Gin Val Val Ser lie 
65 ~ 70 75 80 

Pro Thr Met lie Leu Phe Lys Asp Gly Ala Pro Val Lys Arg He Val 

85 90 95 

Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg Glu Leu Ser Asp Ala Leu 
100 105 ~ 110 



<210> 130 
<211> 115 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 130 



Thr 


Asp 


Ser 


Glu 


Lys 


Ser 


Ala 


Thr 


He Lys Val Thr Asp Ala 


Ser Phe 


1 








5 








10 


15 


Ala 


Thr 


Asp 


Val 
20 


Leu 


Ser 


Ser 


Asn 


Lys Pro Val Leu Val Asp 
25 30 


Phe Trp 


Ala 


Thr 


Trp 
35 


Cys 


Gly 


Pro 


Cys 


Lys 
40 


Met Val Ala Pro Val Leu 
45 


Glu Glu 


He 


Ala 
50 


Thr 


Glu 


Arg 


Ala 


Thr 
55 


Asp 


Leu Thr Val Ala Lys Leu 
60 


Asp Val 


Asp 


Thr 


Asn 


Pro 


Glu 


Thr 


Ala 


Arg 


Asn Phe Gin Val Val Ser 


lie Pro 


65 










70 






75 


80 


Thr 


Leu 


lie 


Leu 


Phe 
85 


Lys 


Asp 


Gly 


Gin Pro Val Lys Arg He 
90 


Val Gly 
95 


Ala 


Lys 


Gly 


Lys 
100 


Ala 


Ala 


Leu 


Leu 


Arg Glu Leu Ser Asp Val 
105 110 


Val Pro 


Asn 


Leu 


Asn 
115 

















<210> 131 
<211> 127 
<212> PRT 

<213> Neurospora crassa 
<400> 131 



Met 


Ser 


Asp 


Gly 


Val 


Lys 


His 


He 


Asn Ser Ala Gin Glu Phe 


Ala Asn 


1 






5 








10 


15 


Leu 


Leu 


Asn 


Thr 


Thr 


Gin 


Tyr 


Val 


Val Ala Asp Phe Tyr Ala 


Asp Trp 








20 










25 30 




Cys 


Gly 


Pro 


Cys 


Lys 


Ala 


He 


Ala 


Pro Met Tyr Ala Gin Phe 


Ala Lys 






35 










40 


45 




Thr 


Phe 


Ser 


He 


Pro 


Asn 


Phe 


Leu 


Ala Phe Ala Lys He Asn 


Val Asp 




50 










55 




60 




Ser 


Val 


Gin 


Gin 


Val 


Ala 


Gin 


His 


Tyr Arg Val Ser Ala Met 


Pro Thr 


65 










70 






75 


80 


Phe 


Leu 


Phe 


Phe 


Lys 


Asn 


Gly 


Lys 


Gin Val Ala Val Asn Gly 


Ser Val 










85 




90 


95 


Met 


He 


Gin 


Gly 


Ala 


Asp 


Val 


Asn 


Ser Leu Arg Ala Ala Ala 


Glu Lys 








100 










105 110 




Met 


Gly 


Arg 


Leu 


Ala 


Lys 


Glu 


Lys 


Ala Ala Ala Ala Gly Ser 


Ser 






115 










120 


125 





<210> 132 
<211> 106 
<212> PRT 

<213> Penicillium chrysogenum 
<400> 132 

Met Gly Val Thr Pro He Lys Ser Val Ala Glu Tyr Lys Glu 

1 5 10 

Thr Asp Ala Thr Gly Pro Val Val Val Asp Phe His Ala Thr 
20 25 30 



Lys Val 
15 

Trp Cys 
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Gly Pro Cys Lys Ala He Ala Pro 

35 40 
His Thr Gly He Gin Phe Tyr Lys 

50 55 
Val Ala Ala Ser Asn Gly Val Ser 
65 70 ' 

Lys Gly Gly Glu Arg Asn Glu Glu 
85 

He Gin Ala Gly Val Lys Ala He 
100 



Ala Leu Glu Lys Leu Ser Glu Thr 
45 

Val Asp Val Asp Glu Leu Ser Glu 
60 

Ala Met Pro Thr Phe His Phe Tyr 

75 80 
Val Lys Gly Ala Asn Pro Ala Ala 

90 95 
Leu Glu 
105 



<210> 133 
<211> 108 
<212> PRT 

<213> Pseudomonas aeruginosa 
<400> 133 



Met 


Ser 


Glu 


His 


He 


Val 


Asn 


Val Thr Asp Ala 


Ser Phe Glu Gin Asp 


1 








5 






10 


15 


Val 


Leu 


Lys 


Ala 


Asp 


Gly 


Pro 


Val Leu Val Asp 


Tyr Trp Ala Glu Trp 








20 








25 


30 


Cys 


Gly 


Pro 


Cys 


Lys 


Met 


He 


Ala Pro Val Leu 


Asp Glu Val Ala Arg 






35 










40 


45 


Asp 


Tyr 


Gin 


Gly 


Lys 


Leu 


Lys 


Val Cys Lys Leu 


Asn He Asp Glu Asn 




50 










55 




60 


Gin 


Asp 


Thr 


Pro 


Pro 


Lys 


Tyr 


Gly Val Arg Gly 


He Pro Thr Leu Met 


65 










70 




75 


80 


Leu 


Phe 


Lys 


Asp 


Gly 


Asn 


Val 


Glu Ala Thr Lys 


Val Gly Ala Leu Ser 










85 






90 


95 


Lys 


Ser 


Gin 


Leu 


Ala 


Ala 


Phe 


Leu Asp Ala Asn 


He 



100 105 



<210> 134 
<211> 104 
<212> PRT 

<213> Rhodospir ilium rub rum 
<220> 

<221> VARIANT 
<222> 21, 35 

<223> Xaa = Any Amino Acid 
<400> 134 



Met Lys 


Gin 


Val 


Ser 


Asp 


Ala 


Ser Phe Glu Glu 


Asp Val Leu Lys Ala 


1 






5 




10 


15 


Asp Gly 


Pro 


Asn 


Xaa 


Val 


Asp 


Phe Trp Ala Glu 


Trp Cys Gly Pro Cys 






20 








25 


30 


Arg Gin 


Xaa 


Ala 


Pro 


Ala 


Leu 


Glu Glu Leu Ala 


Thr Ala Leu Gly Asp 




35 










40 


45 


Lys Val 


Thr 


Val 


Ala 


Lys 


He 


Asn He Asp Glu 


Asn Pro Gin Thr Pro 


50 








55 


60 


Ser Lys 


Tyr 


Gly 


Val 


Arg 


Gly 


He Pro Thr Leu 


Met He Phe Lys Asp 


65 








70 




75 


80 


Gly Gin 


Val 


Ala 


Ala 


Thr 


Lys 


He Gly Ala Leu 


Pro Lys Thr Lys Leu 








85 






90 


95 


Phe Glu 


Trp 


Val 


Glu 


Ala 


Ser 


Val 






100 













<210> 135 
<211> 105 
<212> PRT 

<213> Rhodobacter sphaeroides 
<400> 135 
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Ser Thr 


Val 


Pro 


Val 


Thr 


Asp 


Ala 


Thr 


Phe Asp Thr Glu Val Arg Lys 


1 






5 










10 15 


Ser Asp 


Val 


Pro 
20 


Val 


Val 


Val 


Asp 


Phe 
25 


Trp Ala Glu Trp Cys Gly Pro 
30 


Cys Arg 


Gin 


lie 


Gly 


Pro 


Ala 


Leu 


Glu 


Glu Leu Ser Lys Glu Tyr Ala 


35 










40 




45 


Gly Lys 


Val 


Lys 


lie 


Val 


Lys 


Val 


Asn 


Val Asp Glu Asn Pro Glu Ser 


50 










55 






60 


Pro Ala 


Met 


Leu 


Gly 


Val 


Arg 


Gly 


He 


Pro Ala Leu Phe Leu Phe Lys 


65 






70 




75 80 


Asn Gly 


Gin 


Val 


Val 


Ser 


Asn 


Lys 


Val 


Gly Ala Ala Pro Lys Ala Ala 






85 










90 95 


Leu Ala 


Thr 


Trp 
100 


He 


Ala 


Ser 


Ala 


Leu 
105 





<210> 136 

<211> 130 

<212> PRT 

<213> Rickettsia prowazekii 



<400> 136 



Met 


Ser 


Cys 


Tyr 


Asn 


Glu 


He 


Thr 


Thr 


Leu Leu Glu Phe Asp Ser Asn 


1 








5 










10 15 . 


Asp 


lie 


Asn 


Thr 


Thr 


Gin 


Arg 


He 


Asn 


Met Val Asn Asn Val Thr Asp 






20 










25 


30 


Ser 


Ser 


Phe 


Lys 


Asn 


Glu 


Val 


Leu 


Glu 


Ser Asp Leu Pro Val Met Val 






35 








40 




45 


Asp 


Phe 


Trp 


Ala 


Glu 


Trp 


Cys 


Gly 


Pro 


Cys Lys Met Leu He Pro He 




50 










55 






60 


He 


Asp 


Glu 


He 


Ser 


Lys 


Glu 


Leu 


Gin 


Asp Lys Val Lys Val Leu Lys 


65 










70 








75 80 


Met 


Asn 


He 


Asp 


Glu 


Asn 


Pro 


Lys 


Thr 


Pro Ser Glu Tyr Gly He Arg 








85 










90 95 


Ser 


lie 


Pro 


Thr 


He 


Met 


Leu 


Phe 


Lys 


Asn Gly Glu Gin Lys Asp Thr 








100 










105 


110 


Lys 


He 


Gly 


Leu 


Gin 


Gin 


Lys 


Asn 


Ser 


Leu Leu Asp Trp He Asn Lys 



115 120 125 



Ser He 
130 



<210> 137 

<211> 106 

<212> PRT 

<213> Streptomyces aureofaciens 

<400> 137 



Gly Ala 


Thr 


Val 


Lys 


Val 


Thr 


Asn 


Ala 


Thr Phe Lys Ser Asp Val Leu 


1 






5 










10 15 


Glu Ser 


Asp 


Lys 


Pro 


Val 


Leu 


Val 


His 


Phe Glu Gly Pro Trp Cys Gly 






20 










25 


30 


Pro Cys 


Lys 


Met 


Val 


Ala 


Pro 


Val 


Leu 


Asp Glu He Ala Asn Glu Tyr 


35 










40 




45 


Glu Gly 


Lys 


Val 


Lys 


Val 


Ala 


Lys 


Val 


Asn Thr Asp Glu Asn Pro Gin 


50 










55 






60 


Leu Ala 


Ser 


Gin 


Tyr 


Gly 


Val 


Arg 


Ser 


He Pro Thr Arg Leu Met Phe 


65 








70 








75 ~ 80 


Lys Gly 


Gly 


Glu 


Val 


Ala 


Ala 


Asn 


Met 


Val Gly Ala Ala Pro Lys Thr 








85 










90 95 


Arg Leu 


Ala 


Ala 


Phe 


Leu 


Asp 


Ala 


Ser 


Leu 



100 105 



<210> 138 
<211> 110 
<212> PRT 
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<213> Streptomyces coelicolor 
<400> 138 



Met 


Ala 


Gly 


Thr 


Leu 


Lys 


His 


Val 


Thr Asp Asp Ser Phe Glu Gin Asp 


1 








5 








10 15 


Val 


Leu 


Lys 


Asn 


Asp 


Lys 


Pro 


Val 


Leu Val Asp Phe Trp Ala Ala Trp 








20 






25 30 


Cys 


Gly 


Pro 


Cys 


Arg 


Gin 


He 


Ala 


Pro Ser Leu Glu Ala He Ala Ala 






35 






40 


45 


Glu 


Tyr 


Gly 


Asp 


Lys 


He 


Glu 


He 


Val Lys Leu Asn He Asp Glu Asn 




50 










55 




60 


Pro 


Gly 


Thr 


Ala 


Ala 


Lys 


Tyr 


Gly 


Val Met Ser He Pro Thr Leu Asn 


65 








70 


75 80 


Val 


Tyr 


Gin 


Gly 


Gly 


Glu 


Val 


Ala 


Lys Thr He Val Gly Ala Lys Pro 










85 








90 95 


Lys 


Ala 


Ala 


lie 


Val 


Arg 


Asp 


Leu 


Glu Asp Phe He Ala Asp 



100 105 110 



<210> 139 
<211> 107 
<212> PRT 

<213> Streptomyces clavuligerus 
<400> 139 



Met Ala 


Gly 


Val 


Leu 


Lys 


Asn 


Val 


Thr Asp Asp Thr Phe Glu Ala Asp 


1 






5 








10 15 


Val Leu 


Lys 


Ser 


Glu 


Lys 


Pro 


Val 


Leu Val Asp Phe Trp Ala Glu Trp 






20 








25 30 


Cys Gly 


Pro 


Cys 


Arg 


Gin 


He 


Ala 


Pro Ser Leu Glu Ala He Thr Glu 




35 






40 


45 


His Gly 


Gly 


Gin 


He 


Glu 


He 


Val 


Lys Leu Asn He Asp Gin Asn Pro 


50 








55 




60 


Ala Thr 


Ala 


Ala 


Lys 


Tyr 


Gly 


Val 


Met Ser He Pro Thr Leu Asn Val 


65 








70 






75 80 


Tyr Gin 


Gly 


Gly 


Glu 
85 


Val 


Val 


Lys 


Thr He Val Gly Ala Lys Pro Lys 
90 95 


Ala Ala 


Leu 


Leu 
100 


Arg 


Pro 


Gly 


Pro 


Val Pro Arg 
105 



<210> 140 
<211> 106 
<212> PRT 

<213> Synechocystis 
<400> 140 



Ser Ala 


Thr 


Pro 


Gin 


Val 


Ser 


Asp 


Ala Ser Phe Lys Glu Asp Val Leu 


1 






5 






10 15 


Asp Ser 


Glu 


Leu 


Pro 


Val 


Leu 


Val 


Asp Phe Trp Ala Pro Trp Cys Gly 






20 










25 30 


Pro Cys 


Arg 


Met 


Val 


Ala 


Pro 


Val 


Val Asp Glu He Ser Gin Gin Tyr 




35 










40 


45 


Glu Gly 


Lys 


Val 


Lys 


Val 


Val 


Lys 


Leu Asn Thr Asp Glu Asn Pro Asn 


50 










55 




60 


Thr Ala 


Ser 


Gin 


Tyr 


Gly 


He 


Arg 


Ser He Pro Thr Leu Met • He Phe 


65 








70 




75 80 


Lys Gly 


Gly 


Gin 


Arg 


Val 


Asp 


Met 


Val Val Gly Ala Val Pro Lys Thr 








85 








90 95 


Thr Leu 


Ala 


Ser 


Thr 


Leu 


Glu 


Lys 


Tyr Leu 






100 








105 



<210> 141 
<211> 109 
<212> PRT 

<213> Synechocystis 
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<400> 141 



Met Ser 


Leu 


Leu 


Glu 


lie 


Thr 


Asp 


1 






5 








Gly Gin 


Thr 


Lys 


Pro 


Val 


Leu 


Val 






20 










Pro Cys 


Arg 


Leu 


Met 


Ala 


Pro 


Ala 




35 










40 


Gly Asp 


Lys 


Leu 


Lys 


Val 


Leu 


Lys 


50 










55 




Ala Val 


Ala 


Gin 


Cys 


Lys 


Val 


Glu 


65 








70 






Lys Asn 


Asn 


Glu 


Leu 


Val 


Met 


Thr 








85 








Lys Leu 


Leu 


Glu 


Leu 


Leu 


Lys 


Glu 



100 



Ala Glu Phe Glu Gin Glu Thr Gin. 

10 15 
Tyr Phe Trp Ala Ser Trp Cys Gly 
25 30 
He Gin Ala He Ala Lys Asp Tyr 
45 

Leu Glu Val Asp Pro Asn Pro Ala 
60 

Gly Val Pro Ala Leu Arg Leu Phe 

75 80 
His Glu Gly Ala He Ala Lys Pro 

90 95 
Glu Leu Asp Phe He 
105 



<210> 142 
<211> 108 
<212> PRT 

<213> Thiobacillus ferrooxidans 
<400> 142 

Met Ser Asp Ala He Leu Tyr Val Ser Asp Asp Ser Phe Glu Thr Asp 

1 5 10 15 

Val Leu Lys Ser Ser Lys Pro Val Leu Val Asp Phe Trp Ala Glu Trp 

20 25 30 

Cys Gly Pro Cys Lys Met He Ala Pro He Leu Glu Glu He Ala Asp 

35 40 45 

Glu Tyr Ala Asp Axg Leu Arg Val Ala Lys Phe Asn He Asp Glu Asn 

50 55 60 

Pro Asn Thr Pro Pro Gin Tyr Ala He Arg Gly He Pro Thr Leu Leu 
65 70 75 80 

Leu Phe Lys Ala Gly Lys Leu Glu Ala Thr Lys Val Gly Ala Leu Ser 

85 90 95 

Lys Ala Gin Leu Thr Ala Phe Leu Asp Ser Gin Leu 
100 105 



<210> 143 
<211> 91 
<212> PRT 

<213> Thiocapsa roseopersicina 
<400> 143 

Met Ser Asp Ser He Val His Val Thr Asp Asp Ser Phe Glu Asp Glu 

15 10 15 

Val Leu Lys Ser Leu Glu Pro Val Leu Val Asp Tyr Trp Ala Asp Trp 

20 25 30 

Cys Gly Pro Cys Lys Met He Ala Pro Val Leu Asp Glu He Ala Gly 

35 40 45 

Glu Tyr Ala Gly Arg He Lys Val Ala Lys Leu Asn He Asp Glu Asn 

50 ~ 55 60 

Pro Asn Thr Pro Arg Arg Tyr Gly He Arg Gly He Pro Thr Leu Met 
65 70 75 80 

Leu Ser Arg Gin Ser Glu Val Glu Ala Thr Lys 
85 90 



<210> 144 
<211> 44 
<212> PRT 

<213> Tissierella creatinophila 
<400> 144 

Met He Glu Leu Asp Lys Ser Asn Phe Glu Glu Glu Val Leu Lys Ala 
15 10 15 
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Glu Gly Thr Val Leu Val Asp Phe 
20 

Lys Ala Leu Met Pro His Val His 

. 35 40 



Trp Ser Pro Ser Cys Glu Pro Cys 
25 30 
Asp Phe Glu Glu , 



<210> 145 
<211> 105 
<212> PRT 

<213> Treponema pallidum 



<400> 145 



Met 


Ala 


Leu 


Leu 


Asp lie Ser Ser 


1 








5 


Thr 


Asn 


Pro 


Leu 


Val He Val Asp 








20 


Cys 


Lys 


Met 


Leu 


Gly Pro Val Leu 






35 




40 


Ser 


Gly 


Val 


Val 


He Gly Lys Leu 




50 






55 


Ala 


Val 


Glu 


Phe 


Asn Val Ala Ser 


65 








70 


Asp 


Gly 


Lys 


Glu 


Val Asp Arg Ser 










85 


lie 


Leu 


Thr 


Leu 


He Gin Lys Asn 








100 



Gly Asn Val Arg Lys Thr He Glu 

10 15 
Phe Trp Ala Pro Trp Cys Gly Ser 
25 30 
Glu Glu Val Glu Ser Glu Val Gly 
45 

Asn Val Asp Asp Asp Gin Asp Leu 
60 

He Pro Thr Leu He Val Phe Lys 

75 80 
He Gly Phe Val Asp Lys Ser Lys 
90 95 

Ala 
105 



<210> 146 
<211> 104 
<212> PRT 
<213> Bos taurus 



<400> 146 



Val LyB 


Gin 


He 


Glu Ser Lys Tyr 


1 






5 


Ala Gly 


Glu 


Lys 


Leu Val Val Val 




20 




Pro Cys 


Ly s 


Met 


He Lys Pro Phe 




35 




40 


Ser Asn 


Val 


Val 


Phe Leu Glu Val 


50 






55 


Ala Ala 


Glu 


Cys 


Glu Val Lys Cys 


65 






70 


Lys Gly 


Gin 


Lys 


Val Gly Glu Phe 








85 


Glu Ala 


Thr 


He 


Asn Glu Leu He 






100 





Ala Phe Gin Glu Ala Leu Asn Ser 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 
Phe His Ser Leu Ser Glu Lys Tyr 
45 

Asp Val Asp Asp Cys Gin Asp Val 
60 

Met Pro Thr Phe Gin Phe Phe Lys 

75 80 
Ser Gly Ala Asn Lys Glu Lys Leu 
90 95 



<210> 147 
<211> 166 
<212> PRT 
<213> Bos taurus 

<400> 147 

Met Ala Gin Arg Leu Leu Leu Arg Arg Phe Leu Thr Ser He He Ser 

1 5 10 15 

Gly Lys Pro Ser Gin Ser Arg Trp Ala Pro Val Ala Ser Arg Ala Leu 

20 25 30 

Lys Thr Pro Gin Tyr Ser Pro Gly Tyr Leu Thr Val Thr Pro Ser Gin 

.35 40 ~ 45 

Ala Arg Ser He Tyr Thr Thr Arg Val Cys Ser Thr Thr Phe Asn He 

50 55 60 

Gin Asp Gly Pro Asp Phe Gin Asp Arg Val Val Asn Ser Glu Thr Pro 
65 70 75 80 

Val Val Val Asp Phe His Ala Gin Trp Cys Gly Pro Cys Lys He Leu 



-86 



WO 02/50289 



PCTAJS01/50240 





85 




Gly Pro Arg Leu 


Glu 


Lys Val Val 


100 






Met Ala Lys Val 


Asp 


He Asp Asp 


115 




120 


Glu Val Ser Ala 


Val 


Pro Thr Val 


130 




135 


Val Asp Lys Phe 


Val 


Gly He Lys 


145 




150 


Leu Lys Lys Leu 


He 
165 


Gly 



90 95 
Ala Lys Gin His Gly Lys Val Val 
105 110 
His Thr Asp Leu Ala Leu Glu Tyr 
125 

Leu Ala Met Lys Asn Gly Asp Val 
140 

Asp Glu Asp Gin Leu Glu Ala Phe 
155 160 



<210> 148 
<211> 115 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 148 










Met 


Leu 


Lys 


Arg 


Cys 


Asn 


Phe Lys 


1 








5 






Asp 


Phe 


Glu 


Gin 
20 


Leu 


He 


Arg Gin 


Asp 


Phe 


Tyr 
35 


Ala 


Thr 


Trp 


Cys Gly 
40 


Tyr 


Lys 
50 


Glu 


Leu 


Ala 


Thr 


Thr His 
55 


Asp 


Val 


Asp 


Glu 


Ala 


Glu 


Asp Leu 


65 










70 




Met 


Pro 


Thr 


Phe 


He 
85 


Phe 


Thr Lys 


Glu 


Gly 


Cys 


Val 
100 


Glu 


Asp 


Glu Leu 


Ser 


Ala 


Gin 
115 











Asn Gin Val Lys Tyr Phe Gin Ser 

10 15 
His Pro Glu Lys He lie He Leu 
25 30 
Pro Cys Lys Ala He Ala Pro Leu 
45 

Lys Gly He He Phe Cys Lys Val 
60 

Cys Ser Lys Tyr Asp Val Lys Met 

75 80 
Asn Gly Asp Ala He Glu Ala Leu 

90 95 
Arg Gin Lys Val Leu Glu His Val 
105 110 



<210> 149 
<211> 20 
<212> PRT 
<213> Canis 



f amiliaris 



<400> 149 

Val Lys Gin He Glu Phe Lys Tyr Ala Phe Gin Glu Ala Leu Asn Ser 

1 .5 10 15 

Ala Gly Asp Lys 
20 



<210> 150 
<211> 104 
<212> PRT 

<213> Gallus gallus 



<400> 150 



Val Lys Ser 


Val 


Gly Asn 


Leu Ala 


1 




5 








Ala Gly Glu 


Lys 
20 


Leu 


Val 


Val 


Val 


Pro Cys Lys 


Met 


He 


Lys 


Pro 


Phe 


35 










40 


Gly Asp Val 


Val 


Phe 


He 


Glu 


lie 


50 








55 




Ala Thr His 


Cys 


Asp 


Val 


Lys 


Cys 


65 






70 






Asn Gly Lys 


Lys 


Val 
85 


Gin 


Glu 


Phe 



Asp Phe Glu Ala Glu Leu' Lys Ala 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 
Phe His Ser Leu Cys Asp Lys Phe 
45 

Asp Val Asp Asp Ala Gin Asp Val 
60 

Met Pro Thr Phe Gin Phe Tyr Lys 

75 80 
Ser Gly Ala Asn Lys Glu Lys Leu 
90 95 
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Glu Glu Thr lie Lys Ser Leu Val 
100 



<210> 151 
<211> 107 
<212> PRT 

<213> Drosophila melanogaster 



<400> 151 



Met 


Ala 


Ser 


Val 


Arg Thr Met Asn 


1 








5 


Ala 


Asp 


Asp 


Lys 


Leu lie Val Leu 








20 




Pro 


Cys 


Lys 


Glu 


Met Glu Ser Thr 






35 




40 


Ser 


Ser 


Lys 


Ala 


Val Val Leu Lys 




50 






55 


Leu 


Thr 


Glu 


Arg 


Tyr Lys Val Arg 


65 








70 


Arg 


Gin 


Asn 


Arg 


Arg Leu Ala Ser 










85 


Leu 


Thr 


Asn 


Met 


Met Ala Lys Leu 








100 



Asp Tyr His Lys Arg lie Glu Ala 

10 15 
Asp Phe Tyr Ala Thr Trp Cys Gly 
25 30 
Val Lys Ser Leu Ala Arg Lys Tyr 
45 

lie Asp Val Asp Lys Phe Glu Glu 
60 

Ser Met Pro Thr Phe Val Phe Leu 

75 80 
Phe Ala Gly Ala Asp Glu His Lys 

90 95 
Val Lys Ala 
105 



<210> 152 
<211> 104 
<212> PRT 
<213> Homo sapien 



<400> 152 

Val Lys Gin lie Glu Ser Lys Thr Ala Phe Gin Glu Ala Leu Asp Ala 

1 5 - 10 15 

Ala Gly Asp Lys Leu Val Val Val Asp Phe Ser Ala Thr Trp Cys Gly 

20 25 30 

Pro Cys Lys Met lie Lys Pro Phe Phe His Ser Leu Ser Glu Lys Tyr 

35 40 45 

Ser Asn Val lie Phe Leu Glu Val Asp Val Asp Asp Cys Gin Asp Val 

50 55 60 

Ala Ser Glu Cys Glu Val Lys Cys Met Pro Thr Phe Gin Phe Phe Lys 
65 70 * 75 80 

Lys Gly Gin Lys Val Gly Glu Phe Ser Gly Ala Asn Lys Glu Lys Leu 

85 90 95 

Glu Ala Thr lie Asn Glu Leu Val 
100 



<210> 153 
<211> 166 
<212> PRT 
<213> Homo sapien 



<400> 153 






Met Ala 


Gin 


Arg 


Leu Leu Leu Arg 


1 






5 


Arg Lys 


Pro 


Ser 


Gin Gly Gin Trp 






20 




Gin Thr 


Pro 


Gin 


Cys Ser Pro Gly 




35 




40 


Ala Arg 


Thr 


lie 


Tyr Thr Thr Arg 


50 






55 


Gin Asp 


Gly 


Pro 


Asp Phe Gin Asp 


65 






70 


Val Val 


Val 


Asp 


Phe His Ala Gin 






85 


Gly Pro 


Arg 


Leu 


Glu Lys Met Val 



Arg Phe Leu Ala Ser Val lie Ser 

10 ■ 15 

Pro Pro Leu Thr Ser Lys Ala Leu 
25 30 
Gly Leu Thr Val Thr Pro Asn Pro 
45 

lie Ser Leu Thr Thr Phe Asn lie 
60 

Arg Val Val Asn Ser Glu Thr Pro 

75 80 
Trp Cys Gly Pro Cys Lys lie Leu 

90 95 
Ala Lys Gin His Gly Lys Val Val 
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100 105 110 

Met Ala Lys Val Asp He Asp Asp His Thr Asp Leu Ala He Glu Tyr 

115 120 " 125 

Glu Val Ser Ala Val Pro Thr Val Leu Ala Met Lys Asn Gly Asp Val 

130 135 140 

Val Asp Lys Phe Val Gly He Lys Asp Glu Asp Gin Leu Glu Ala Phe 
145 150 155 160 

Leu Lys Lys Leu He Gly 
165 



<210> 154 
<211> 104 
<212> PRT 

<213> Macaca mulatta 



<400> 154 



Val Lys 


Gin 


He 


Glu Ser Lys Ala 


1 






5 


Ala Gly 


Asp 


Lys 


Leu Val Val Val 






20 




Pro Cys 


Lys 


Met 


He Lys Pro Phe 




35 




40 


Ser Asn 


Val 


Val 


Phe Leu Glu Val 


50 






55 


Ala Ser 


Glu 


Cys 


Glu Val Lys Cys 


65 






70 


Lys Gly 


Gin 


Lys 


Val Gly Glu Phe 








85 


Glu Ala 


Thr 


lie 


Asn Glu Leu Val 






100 





Ala Phe Gin Glu Ala Leu Asp Asp 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 
Phe His Ser Leu Ser Glu Lys Tyr 
45 

Asp Val Asp Asp Cys Gin Asp Val 
60 

Met Pro Thr Phe Gin Phe Phe Lys 

75 80 
Ser Gly Ala Asn Lys Glu Lys Leu 
90 95 



<210> 155 
<211> 104 
<212> PRT 
<213> Mus musculus 



<400> 155 



Val Lys 


Leu 


He 


Glu Ser Lys Glu 


1 






5 


Ala Gly 


Asp 


Lys 


Leu Val Val Val 






20 




Pro Cys 


Lys 


Met 


He Lys Pro Phe 




35 




40 


Ser Asn 


Val 


Val 


Phe Leu Glu Val 


50 






55 


Ala Ala 


Asp 


Cys 


Glu Val Lys Cys 


65 






70 


Lys Gly 


Gin 


Lys 


Val Gly Glu Phe 








85 


Glu Ala 


Ser 


He 


Thr Glu Tyr Ala 






100 



Ala Phe Gin Glu Ala Leu Ala Ala 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 
Phe His Ser Leu Cys Asp Lys Tyr 
45 

Asp Val Asp Asp Cys Gin Asp Val 
60 

Met Pro Thr Phe Gin Phe Tyr Lys 

75 " 80 

Ser Gly Ala Asn Lys Glu Lys Leu 
90 95 



<210> 156 
<211> 166 
<212> PRT 
<213> Mus musculus 

<400> 156 

Met Ala Gin Arg Leu Leu Leu Gly Arg Phe Leu Thr Ser Val He Ser 

15 10 15 

Arg Lys Pro Pro Gin Gly Val Trp Ala Ser Leu Thr Ser Lys Thr Leu 

20 25 30 

Gin Thr Pro Gin Tyr Asn Ala Gly Gly Leu Thr Val Met Pro Ser Pro 
35 40 45 
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Ala 


Arg 


Thr 


Val 


His 


Thr Thr Arg 




50 








55 


Gin 


Asp 


Gly Pro 


Asp 


Phe Gin Asp 


65 










70 


Val 


Val 


Val Asp 


Phe 


His Ala Gin 










85 




Gly 


Pro 


Arg 


Leu 


Glu 


Lys Met Val 








100 






Met 


Ala 


Lys 


Val 


Asp 


lie Asp Asp 






115 






120 


Glu 


Val 


Ser 


Ala 


Val 


Pro Thr Val 




130 








135 


Val 


Asp 


Lys 


Phe 


Val 


Gly lie Lys 


145 










150 


Leu 


Lys 


Lys 


Leu 


lie 


Gly 










165 





Val Cys Leu Thr Thr Phe Asn Val 
60 

Arg Val Val Asn Ser Glu Thr Pro 

75 80 
Trp Cys Gly Pro Cys Lys lie Leu 

90 95 
Ala Lys Gin His Gly Lys Val Val 
105 110 
His Thr Asp Leu Ala lie Glu Tyr 
125 

Leu Ala lie Lys Asn Gly Asp Val 
140 

Asp Glu Asp Gin Leu Glu Ala Phe 
155 160 



<210> 157 
<211> 33 
<212> PRT 
<213> Sus scrofa 



<400> 157 

Val Lys Gin He Glu Ser Lys Tyr 

1 5 
Ala Gly Glu Lys Leu Val Val Val 
20 

Pro 



Ala Phe Gin Glu Ala Leu Asn Ser 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 



<210> 158 
<211> 104 
<212> PRT 

<213> Oryctolagus cuni cuius 



<400> 158 



Val Lys 


Gin 


He 


Glu 


Ser 


Lys Ser 


1 






5 




Ala Gly 


Asp 


Lys 


Leu 


Val 


Val Val 






20 








Pro Cys 


Lys 


Met 


He 


Lys 


Pro Phe 




35 








40 


Asn Asn 


Val 


Val 


Phe 


He 


Glu Val 


50 










55 


Ala Ala 


Glu 


Cys 


Glu 


Val 


Lys Cys 


65 








70 




Lys Gly 


Gin 


Lys 


Val 


Gly 


Glu Phe 








85 






Glu Ala 


Thr 


He 


Asn 


Glu 


Leu Leu 



100 



Ala Phe Gin Glu Val Leu Asp Ser 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 
Phe His Ala Leu Ser Glu Lys Phe 
45 

Asp Val Asp Asp Cys Lys Asp He 
60 

Met Pro Thr Phe Gin Phe Phe Lys 

75 80 
Ser Gly Ala Asn Lys Glu Lys Leu 
90 95 



<210> 159 
<211> 104 
<212> PRT 

<213> Rattus norvegicus 



<400> 159 

Val Lys Leu He Glu Ser Lys Glu Ala Phe Gin Glu Ala Leu Ala Ala 

15 10 15 

Ala Gly Asp Lys Leu Val Val Val Asp Phe Ser Ala Thr Trp Cys Gly 

20 25 30 

Pro Cys Lys Met He Lys Pro Phe Phe His Ser Leu Cys Asp Lys Tyr 

35 40 45 

Ser Asn Val Val Phe Leu Glu Val Asp Val Asp Asp Cys Gin Asp Val . 
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50 55 60 

Ala Ala Asp Cys Glu Val Lys Cys Met Pro Thr Phe Gin Phe Tyr Lys 
65 70 75 80 

Lys Gly Gin Lys Val Gly Glu Phe Ser Gly Ala Asn Lys Glu Lys Leu 

85 90 95 

Glu Ala Thr lie Thr Glu Phe Ala 
100 



<210> 160 
<211> 166 
<212> PRT 

<213> Rattus norvegicus 
<400> 160 



Met Ala 


Gin 


Arg Leu 


Leu Leu Arg 


1 




5 


Arg Lys 


Pro 


Pro Gin 


Gly Val Trp 






20 




Gin Thr 


Pro 


Pro Tyr 


Asn Ala Gly 




35 




40 


Ala Arg 


Thr 


Phe His 


Thr Thr Arg 


50 






55 


Gin Asp 


Gly 


Pro Asp 


Phe Gin Asp 


65 






70 


Val Val 


Val 


Asp Phe 


His Ala Gin 






85 




Gly Pro 


Arg 


Leu Glu 


Lys Met Val 






100 




Met Ala 


Lys 


Val Asp 


lie Asp Asp 




115 




120 


Glu Val 


Ser 


Ala Val 


Pro Thr Val 


130 






135 


Val Asp 


Lys 


Phe Val 


Gly He Lys 


145 






150 


Leu Lys 


Lys 


Leu lie 


Gly 






165 





Arg Phe Leu Thr Ser Val He Ser 

10 * 15 

Ala Ser Leu Thr Ser Thr Ser Leu 
25 30 
Gly Leu Thr Gly Thr Pro Ser Pro 
45 

Val Cys Ser Thr Thr Phe Asn Val 
60 

Arg Val Val Asn Ser Glu Thr Pro 

75 80 
Trp Cys Gly Pro Cys Lys He Leu 

90 95 
Ala Lys Gin His Gly Lys Val Val 
105 110 
His Thr Asp Leu Ala He Glu Tyr 
125 

Leu Ala He Lys Asn Gly Asp Val 
140 

Asp Glu Asp Gin Leu Glu Ala Phe 
155 160 



<210> 161 
<211> 104 
<212> PRT 
<213> Ovis aries 



<400> 161 



Val Lys 


Gin 


lie 


Glu 


Ser Lys 


Tyr 


1 






5 








Ala Gly 


Glu 


Lys 
20 


Leu 


Val 


Val 


Val 


Pro Cys 


Lys 
35 


Met 


He 


Lys 


Pro 


Phe 
40 


Ser Asn 


Val 


Val 


Phe 


Leu 


Glu 


Val 


50 










55 




Ala Ala 


Glu 


Cys 


Glu 


Val 


Lys Cys 


65 








70 






Lys Gly 


Gin 


Lys 


Val 
85 


Ser 


Glu 


Phe 


Glu Ala 


Thr 


He 
100 


Asn 


Glu 


Leu 


He 



Ala Phe Gin Glu Ala Leu Asn Ser 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 
Phe His Ser Leu Ser Glu Lys Tyr 
45 

Asp Val Asp Asp Cys Gin Asp Val 
60 

Met Pro Thr Phe Gin Phe Phe Lys 

75 80 
Ser Gly Ala Asn Lys Glu' Lys Leu 
90 95 



<210> 162 

<211> 261 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 162 
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Met 
1 


Ala 


Arg 


Leu 


Val 


Phe 


Ser 


Leu Asn 


Asn 


Leu 


Ser 


Pro 


5 

Arg 


Asn 


Leu 


Gin Ser 








20 






25 


Ala 


Pro 


Arg 


Phe 


Arg 


Ala 


Val 


Arg Cys 






35 










40 


Glu 


Thr 


Lys 


Gin 


Glu 


Lys 


Leu 


Val He 




50 








55 




Ala 


Ser 


Lys 


Glu 


Val 


Glu 


Ser 


Ser Ser 


65 










70 






Ser 


Ser 


Ser 


Ser 


Gly 


Phe 


Pro 


Glu Ser 










85 








Arg 


Val 


Ala 


Ala 


Val 


Thr 


Val 


He Ala 






100 








105 


Thr 


Arg 


Leu 


Asp 


Phe 


Gly 


He 


Ser Leu 






115 










120 


Leu 


Pro 


Tyr 


Glu 


Glu 


Ala 


Leu 


Ser Asn 




130 










135 




Phe 


Tyr 


Ala 


Asp 


Trp 


Cys 


Glu 


Val Cys 


145 










150 






Tyr 


Lys 


lie 


Glu 


Gin 


Gin 


Tyr 


Lys Asp 










165 








Asn 


Val 


Asp 


Asn 


Thr 


Lys 


Trp 


Glu Gin 








180 








185 


Glu 


Gly 


He 


Pro 


His 


Phe 


Ala 


Phe Leu 




195 










200 


Gly 


Asn 


Val 


Val 


Gly 


Arg 


Leu 


Pro Arg 




210 










215 




Asn 


Ala 


Leu 


Ala 


Ala 


Gly 


Lys 


Gin Ser 


225 










230 






Gly 


Gin 


Tyr 


Ser 


Ser 


Ser 


Glu 


Ser Arg 










245 








Pro 


Leu 


Ser 


His 


Gly 









260 



<210> 163 
<211> 140 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 163 



Met 


Gly 


Ser Cys 


Val 


Ser Lys 


Gly Lys 


1 






5 






Asn 


Val 


Glu Phe 


Ser Gly Gly 


Asn Val 






20 






25 


Ser 


Trp 


Asp Asp 


Lys 


Leu Ala 


Glu Ala 






35 






40 


Val 


Ala 


Asn Phe 


Ser 


Ala Thr 


Trp Cys 




50 






55 




Pro 


Phe 


Phe He 


Glu 


Leu Ser 


Glu Lys 


65 








70 




Leu 


Val 


Asp Val 


Asp 


Glu Leu 


Ser Asp 








85 






Lys 


Ala 


Thr Pro 


Thr 


Phe Phe 


Phe Leu 






100 






105 


Lys 


Leu 


Val Gly 


Ala Asn Jjys 


Pro Glu 






115 






120 


He 


He 


Asp Ser 


Val 


Pro Glu 


Ser Pro 




130 






135 





<210> 164 
<211> 186 
<212> PRT 

<213> Arabidopsis thaliana 



PCT/US01/50240 



Leu 


Pro 


Ser 


Ser 


His 


Gly 


Phe 


10 










15 




Phe 


Phe 


Val 


Thr 


Gin 


Thr 


Gly 










30 






Lys 


Pro 


Asn 


Pro 


Glu 


Ser 


Ser 






45 








Asp 


Asn 


Gly 


Glu 


Thr 


Ser 


Ser 




60 










Ser 


Val 


Ala 


Asp 


Ser 


Ser 


Ser 




75 










80 


Pro 


Asn 


Lys 


Asp 


He 


Asn 


Arg 


90 










95 




Ala 


Leu 


Ser 


Leu 


Phe 


Val 


Ser 










110 






Lys 


Asp 


Leu 


Thr 


Ala 


Ser 


Ala 








125 








Gly 


Lys 


Pro 


Thr 


Val 


Val 


Glu 




140 










Arg 


Glu 


Leu 


Ala 


Pro 


Asp 


Val 




155 










160 


Lys 


Val 


Asn 


Phe 


Val 


Met 


Leu 


170 










175 




Glu 


Leu 


Asp 


Glu 


Phe 


Gly 


Val 










190 






Asp 


Arg 


Glu 


Gly 


Asn 


Glu 


Glu 








205 








Gin 


Tyr 


Leu 


Val 


Glu 


Asn 


Val 




220 










He 


Pro 


Tyr 


Ala 


Arg 


Ala 


Val 




235 










240 


Lys 


Val 


His 


Gin 


Val 


Thr 


Asp 


250 










255 




Gly 


Asp 


Asp 


Asp 


Ser 


Val 


His 


10 










15 




His 


Leu 


He 


Thr 


Thr 


Lys 


Glu 










30 






Asp 


Arg 


Asp 


Gly 


Lys 


He 


Val 








45 








Gly 


Pro 


Cys 


Lys 


He 


Val 


Ala 






60 










His 


Ser 


Ser 


Leu 


Met 


Phe 


Leu 




75 










80 


Phe 


Ser 


Ser 


Ser 


Trp 


Asp 


He 


90 










95 




Lys 


Asn 


Gly 


Gin 


Gin 


He 


Gly 










110' 






Leu 


Gin 


Lys 


Lys 


Val 


Thr 


Ser 



125 



Gin Arg Pro 
140 
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<400> 164 



Met 


Ser 


Glu 


He 


Val 


Asn 


Leu 


Ser Ser Ser Leu Arg Ser Leu Asn Pro 


1 








5 






10 15 


Lys 


He 


Ser 


Pro 


Leu 


Val 


Pro 


Pro Tyr Arg Gin Thr Ser Ser Ser Phe 








20 








25 30 


Ser 


Arg 


Pro 


Arg 


Asn 


Phe 


Lys 


Tyr His Ser Phe Thr Asp Lys He Cys 






35 










40 45 


Leu 


Ala 


Ala 


Glu 


Arg 


He 


Arg 


Ala Val Asp He Gin Lys Gin Asp Gly 




50 










55 


60 


Gly 


Leu 


Gin 


Glu 


Leu 


Asp 


Asp 


Ser Pro Val Ser Val Glu Leu Gly Pro 


65 










70 




75 80 


He 


Cys 


Gly 


Glu 


Ser 


His 


Phe 


Asp Gin Val Met Glu Asp Ala Gin Lys 










85 






90 95 


Leu 


Gly 


Glu 


Ser 


Val 


Val 


He 


Val Trp Met Ala Ala Trp Cys Arg Lys 








100 








105 110 


Cys 


He 


Tyr 


Leu 


Lys 


Pro 


Lys 


Leu Glu Lys Leu Ala Ala Glu Phe Tyr 






115 










120 125 


Pro 


Arg 


Leu 


Arg 


Phe 


Tyr 


His 


Val Asp Val Asn Ala Val Pro Tyr Arg 




130 










135 


140 


Leu 


Val 


Ser 


Arg 


Ala 


Gly 


Val 


Thr Leu Trp Arg Asp Gly Gin Lys Gin 


145 










150 




155 160 


Ala 


Glu 


Val 


He 


Gly 


Gly 


His 


Lys Ala His Phe Val Val Asn Glu Val 










165 




170 175 


Arg 


Glu 


Met 


He 


Glu 


Asn 


Asp 


Ser He Thr 



180 185 



<210> 165 
<211> 207 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 165 



Met 


Glu 


Asn 


Met 


Ser 


Asn 


Leu 


Thr Ser Lys Phe Leu Leu Asn Pro Leu 


1 








5 






10 15 


Asn 


Val 


His 


Lys 


His 


Cys 


Ala 


Val Ser Asp Glu Asn Gly Asp Arg Lys 








20 








25 30 


Ser 


His 


Val 


Leu 


Lys 


Gin 


Val 


Cys Ser Cys He Cys Cys Cys Asn Arg 






35 










40 45 


Arg 


Asn 


Lys 


Thr 


Gin 


Ala 


Arg 


Ser Gin Lys Gly Ser Tyr Phe He Lys 




50 










55 


60 


Gly 


Lys 


Val 


His 


Pro 


Val 


Ser 


Arg Met Glu Lys Trp Glu Glu Lys He 


65 










70 




75 80 


Thr 


Glu 


Ala 


Asn 


Ser 


His 


Gly 


Lys He He Ma Arg His Asp Leu He 










85 






90 95 


Leu 


Cys 


Asn 


Met 


Glu 


Gin 


Leu 


Val Val Asn Phe Lys Ala Ser Trp Cys 








100 








105 ** 110 


Leu 


Pro 


Ser 


Lys 


Thr 


He 


Leu 


Pro He Tyr Gin Glu Leu Ala Ser Thr 






115 








120 125 


Tyr 


Thr 


Ser 


Met 


lie 


Phe 


Val 


Thr He Asp Val Glu Glu Leu Ala He 


130 










135 


140 


Ser 


Lys 


Leu 


Ser 


Asp 


Leu 


Gly 


Val Lys He Cys Leu He Gin Glu Phe 


145 










150 




155 160 


Ser 


His 


Glu 


Trp 


Asn 


Val 


Asp 


Ala Thr Pro Thr Val Val Phe Leu Lys 










165 






170 175 


Asp 


Gly 


Arg 


Gin 


Met 


Asp 


Lys 


Leu Val Gly Gly Asp Ala Ala Glu Leu 








180 








185 190 


Gin 


Lys 


Lys 


Thr 


Ala 


Ala 


Ala 


Ala Asn Leu Leu Leu Arg Gin Ser 






195 










200 205 



<210> 166 
<211> 175 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 166 
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Met 


Leu 


He 


Pro His 


Ala 


Val 


Ser Phe Ala 


Phe Thr Tyr 


Leu Arg 


Asn 


1 






5 






10 




15 




Ser 


Ala 


Asn 


Pro Asp 


Gin 


Asn 


Arg Glu Val 


He Ser He 


His Ser 


Thr 








20 






25 




30 




Ser 


Glu 


Leu 


Glu Ala 


Lys 


Thr 


Lys Ala Ala 


Lys Lys Ala 


Ser Arg 


Leu 






35 








40 


45 






Leu 


He 


Leu 


Tyr Phe 


Thr 


Ala 


Thr Trp Cys 


Gly Pro Cys 


Arg Tyr 


Met 




50 








55 




60 






Ser 


Pro 


Leu 


Tyr Ser 


Asn 


Leu 


Ala Thr Gin 


His Ser Arg 


Val Val 


Phe 


65 






70 






75 




80 


Leu 


Lys 


Val 


Asp He 


Asp 


Lys 


Ala Asn Asp 


Val Ala Ala 


Ser Trp 


Asn 






85 






90 




95 




He 


Ser 


Ser 


Val Pro 


Thr 


Phe 


Cys Phe He 


Arg Asp Gly 


Lys Glu 


Val 








100 






105 




110 




Asp 


Lys 


Val 


Val Gly 


Ala 


Asp 


Lys Gly Ser 


Leu Glu Gin 


Lys He 


Ala 




115 








120 


125 






Gin 


His 


Ser 


Ser Ser 


Lys 


Ala 


Arg Tyr He 


Pro Val Phe 


He Lys 


Tyr 




130 






135 




140 






His 


Ser 


Asp 


Leu Leu 


Leu 


Leu 


Val Asn Glu 


Glu Thr Pro 


Thr Ser 


Asn 


145 






150 






155 




160 


Gin 


Lys 


Leu 


Lys Thr 


Lys 


Thr 


Gly Asp Trp 


Phe His He 


Asn Leu 








165 






170 




175 





<210> 167 
<211> 132 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 167 



Met 


Arg 


Lys 


Gin 


Glu 


Ser 


Glu 


Gly Ala Asn 


Leu Glu Phe 


Glu Ser 


Lys 


1 




5 






10 




15 




Ser 


Asn 


Asp 


Asn 


Gly 


Asn 


Val 


Lys He Ala 


Pro Asn Asp 


Gin Ser 


Phe 






20 






25 




30 




Leu 


Thr 


He 
35 


Leu 


Asp 


Asp 


He 


Lys Ser Ser 
40 


Lys Ser Pro 
45 


Ala Val 


He 


Asn 


Tyr 


Gly 


Ala 


Ser 


Trp 


Tyr 


Thr Leu Phe 


Ser Val Phe 


Thr He 


Thr 




50 








55 




60 






Leu 


Phe 


Met 


Leu 


He 


Lys 


Cys 


Ser Met Lys 


Cys Leu Asn 


Glu Asn 


Gly 


65 










70 






75 




80 


Phe 


Val 


Leu 


Lys 


Leu 


Ser 


Asp 


He Asp Glu 


Cys Pro Glu 


Thr Thr 


Arg 








85 




90 




95 




His 


He 


Arg 


Tyr 
100 


Thr 


Pro 


Thr 


Phe Gin Phe 
105 


Tyr Arg Asp 


Gly Glu 
110 


Lys 


Val 


Asp 


Glu 


Met 


Phe 


Gly 


Ala 


Gly Glu Gin 


Arg Leu His 


Asp Arg 


Leu 




115 










120 


125 




Trp 


Leu 
130 


His 


Ser 

















<210> 168 
<211> 151 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 168 



Met Ala Ser 


He Ser Leu Ser 


Ser Ser Thr 


Val Pro Ser 


Leu 


Asn 


Ser 


1 


5 


10 






15 




Lys Glu Ser 


Ser Gly Val Ser 


Ala Phe Ala 


Ser Arg Ser 


He 


Ser 


Ala 


20 


25 




30 






Val Lys Phe 


Gin Phe Pro Val 


Arg Arg He 


Glu Ala Lys 


Lys 


Gin 


Thr 


35 




40 


45 








Phe Asp Ser 


Phe Glu Asp Leu 


Leu Val Asn 


Ser Asp Lys 


Pro 


Val 


Leu 


50 


55 




60 








Val Asp Tyr 


Tyr Ala Thr Trp 


Cys Gly Pro 


Cys Gin Phe 


Met 


Val 


Pro 


65 


70 




75 






80 


He Leu Asn 


Glu Val Ser Glu 


Thr Leu Lys 


Asp Lys He 


Gin Val 


Val 
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85 



90 



95 



Lys Xle Asp Thr Glu Lys Tyr Pro Ser lie Ala Asn Lys Tyr Lys lie 

100 105 110 

Glu Ala Leu Pro Thr Phe lie Leu Phe Lys Asp Gly Glu Pro Cys Asp 

115 120 ^ 125 

Arg Phe Glu Gly Ala Leu Thr Ala Lys Gin Leu lie Gin Arg lie Glu 

130 135 140 

Asp Ser Leu Lys Val Lys Pro 



<210> 169 
<211> 236 
<212> PRT 

<213>- Arabidopsis thaliana 
<400> 169 

Met Ala Gly Val Val Arg Leu Thr Thr Thr Ser Val Gin Ala He Arg 

1 5 10 15 

Val Ser Ser Ser Phe Ser Ser Phe Ala Thr Ala Leu Asn Pro Leu Gin 

20 25 30 

Pro Cys Leu Pro Pro Asn Ser Asn Leu Asn Ser Asp Lys Arg Leu Arg 

35 40 45 

Leu Leu Ser Ser Ser Pro Ser Cys Ser Ser Ser His Tyr His Pro Ser 

50 55 60 

Ser Gly Leu Gly Ser His Leu Pro Leu Arg Arg Pro Lys Ser Gin Val 
65 70 75 80 

Val Arg Val Lys Val Asp Glu Asn Val Ala Glu Thr Glu Pro Pro Lys 

85 90 95 

Trp Trp Glu Arg Asn Ala Pro Asn Met Val Asp He His Ser Thr Glu 

100 105 110 

Glu Phe Leu Ser Ala Leu Ser Gly Ala Gly Glu Arg Leu Val He Val 

115 120 125 

Glu Phe Tyr Gly Thr Trp Cys Ala Ser Cys Arg Ala Leu Phe Pro Lys 

130 135 140 

Leu Cys Lys Thr Ala Val Glu His Pro Asp He Val Phe Leu Lys Val 
145 150 ^ 155 160 

Asn Phe Asp Glu Asn Lys Pro Met Cys Lys Ser Leu Asn Val Arg Val 

165 * 170 175 

Leu Pro Phe Phe His Phe Tyr Arg Gly Ala Asp Gly Gin Leu Glu Ser 

180 185 190 

Phe Ser Cys Ser Leu Ala Lys Val Lys Lys Ala He Ser Val Ser Pro 

195 200 205 

Phe Pro Gin Leu Glu Leu Gly He Thr Leu Gin Thr Lys Arg Thr Thr 

210 215 220 

Ser Leu Phe Phe Phe Asp Arg He Tyr Gin He Leu 
225 230 235 



<210> 170 
<211> 131 
<212> PRT 

<213> Hordeum bulbosum 
<400> 170 

Met Gly Gly Cys Val Gly Lys Asp Arg Ser He Val Glu Asp' Lys Leu 

15 10 15 

Asp Phe Lys Gly Gly Asn Val His Val He Thr Thr Lys Glu Asp Trp 

20 25 30 

Asp Gin Lys Val Ala Glu Ala Asn Lys Asp Gly Lys He Val Val Ala 

35 40 45 

Asn Phe Ser Ala Ser Trp Cys Gly Pro Cys Arg Val He Ala Pro Val 

50 55 60 

Tyr Ala Glu Met Ser Lys Thr Tyr Pro Gin Leu Met Phe Leu Thr He 
65 70 75 80 

Asp Val Asp Asp Leu Met Asp Phe Gly Ser Thr Trp Asp He Arg Ala 



145 



150 



85 



90 



95 
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Thr Pro Thr Phe Phe Phe Leu Lys Asn Gly Gin Gin lie Asp Lys Leu 

100 105 110 

Val Gly Ala Asn Lys Pro Glu Leu Glu Lys Lys Val Gin Ala Leu Gly 

115 120 125 

Asp Gly" Ser 
130 



<210> 171 
<211> 131 
<212> PRT 

<213> Lolium perenne 



<400> 171 



Met 


Gly 


Gly 


Cys 


Val Gly Lys Asp 


1 








5 


Asp 


Phe 


Lys 


Gly 


Gly Asn Val His 








20 




Asp 


Gin 


Lys 


Val 


Ala Glu Ala Asn 






35 




40 


Asn 


Phe 


Ser 


Ala 


Ser Trp Cys Gly 




50 






55 


Tyr 


Ala 


Glu 


Met 


Ser Lys Thr Tyr 


65 








70 


Asp 


Val 


Asp 


Asp 


Leu Met Asp Phe 










85 


Thr 


Pro 


Thr 


Phe 


Phe Phe Leu Lys 








100 


Val 


Gly 


Ala 


Asn 


Arg Pro Glu Leu 






115 




120 


Asp 


Gly 


Ser 








130 









Arg Ser lie Val Glu Asp Lys Leu 

10 15 
Val lie Thr Thr Lys Glu Asp Trp 
25 30 
Lys Asp Gly Lys He Val Val Ala 
45 

Pro Cys Arg Val He Ala Pro Val 
60 

Pro Gin Leu Met Phe Leu Thr He 

75 80 
Ser Ser Thr Trp Asp He Arg Ala 

90 95 
Asn Gly Gin Leu He Asp Lys Leu 
105 * 110 
Glu Lys Lys Val Gin Ala He Gly 
125 



<210> 172 
<211> 131 
<212> PRT 
<213> Oryza sativa 

<400> 172 

Met Gly Ser Cys Val Gly Lys Glu Arg Ser Asp Glu Glu Asp Lys He 

1 5 * " 10 * 15 

Asp Phe Lys Gly Gly Asn Val His Val He Ser Asn Lys Glu Asn Trp 

20 25 30 

Asp His Lys He Ala Glu Ala Asn Lys Asp Gly Lys He Val He Ala 

35 40 45 

Asn Phe Ser Ala Ala Trp Cys Gly Pro Cys Arg Val He Ala Pro Val 

50 55 60 

Tyr Ala Glu Met Ser Gin Thr Tyr Pro Gin Phe Met Phe Leu Thr He 
65 70 75' 80 

Asp Val Asp Glu Leu Met Asp Phe Ser Ser Ser Trp Asp He Arg Ala 

85 90 95 

Thr Pro Thr Phe Phe Phe Leu Lys Asn Gly Glu Gin Val Asp Lys Leu 

100 105 - 110 

Val Gly Ala Asn Lys Pro Glu Leu Glu Lys Lys Val Ala Ala Leu Ala 

115 120 125 

Asp Ser Ala 
130 



<210> 173 
<211> 296 
<212> PRT 

<213> Solanum tuberosum 
<400> 173 

Met Ala Thr Leu Thr Asn Phe Leu Leu Lys Pro Ser Pro Asn Leu Ala 
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15 10 15 

Ser lie Thr Lys lie Ser Pro Ser Leu Tyr Ser Asn Phe Pro Phe Glu 

20 25 ^ 30 

Lys Ser Lys Gin Ser lie Phe Lys Asn Leu Lys Thr Asn Lys Pro Leu 

35 40 ^ 45 

Leu lie Thr Lys Ala Thr Ala Ala Pro Asp Val Glu Lys Lys Val Ala 

50 55 60 

Lys Ser Glu Arg Val Gin Lys Val Asn Ser Met Glu Glu Leu Asp Glu 
65 70 75 80 

Ala Leu Lys Lys Ala Lys Asn Arg Leu Val Val Val Glu Phe Ala Gly 

85 90 95 

Lys Asp Ser Glu Arg Ser Lys Asn lie Tyr Pro Phe Met Val Asn Leu 

100 * 105 ** 110 

Ser Lys Thr Cys Asn Asp Val Asp Phe Leu Leu Val lie Gly Asp Glu 

115 * 120 125 

Thr Glu Lys Thr Lys Ala Leu Cys Arg Arg Glu Lys lie Asp Lys Val 

130 135 ~ 140 

Pro His Phe Asn Phe Tyr Lys Ser Met Glu Lys lie His Glu Glu Glu 
145 150 155 160 

Gly lie Gly Pro Asp Leu Leu Ala Gly Asp Val Leu Tyr Tyr Gly Asp 

165 " 170 175 

Ser His Ser Glu Val Val Gin Leu His Ser Arg Glu Asp Val Glu Lys 

180 185 * 190 

Val lie Gin Asp His Lys lie Asp Lys Lys Leu lie Val Leu Asp Val 

195 200 205 

Gly Leu Lys His Cys Gly Pro Cys Val. Lys Val Tyr Pro Thr Val lie 

210 215 220 

Lys Leu Ser Lys Gin Met Ala Asp Thr Val Val Phe Ala Arg Met Asn 
225 230 235 ~ 240 

Gly Asp Glu Asn Asp Ser Cys Met Gin Phe Leu Lys Asp Met Asp Val 

245 250 255 

lie Glu Val Pro Thr Phe Leu Phe lie Arg Asp Gly Glu He Cys Gly 

260 265 270 

Arg Tyr Val Gly Ser Gly Lys Gly Glu Leu He Gly Glu He Leu Arg 

275 280 285 

Tyr Gin Gly Val Arg Val Thr Tyr 
290 295 



<210> 174 
<211> 131 
<212> PRT 

<213> Secale cereale 
<400> 174 

Met Gly Gly Cys Val Gly Lys Gly Arg Ser He Val Glu Glu Lys Leu 

15 10 15 

Asp Phe Lys Gly Gly Asn Val His Val He Thr Thr Lys Glu Asp Trp 

20 25 30 

Asp Gin Lys He Glu Glu Ala Asn Lys Asp Gly Lys He Val Val Ala 

35 40 " ~ " 45 

Asn Phe Ser Ala Ser Trp Cys Gly Pro Cys Arg Val Val Ala Pro Val 

50 ^ 55 - 60 

Tyr Ala Gly Met Ser Lys Thr Tyr Pro Gin Leu Met Phe Leu Thr He 
65 70 75 80 

Asp Val Asp Asp Leu Met Asp Phe Ser Ser Thr Trp Asp lie' Arg Ala 

85 90 95 

Thr Pro Thr Phe Phe Phe Leu Lys Asn Gly Gin Gin He Asp Lys Leu 

100 105 110 

Val Gly Ala Asn Lys Pro Glu Leu Glu Lys Lys Val Gin Ala Leu Gly 

115 120 125 

Asp Gly Ser 
130 



<210> 175 
<211> 119 
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<212> PRT 

<213> Secale cereale 



<400> 175 



Met Gly 


Gly 


Cys 


Val 


Gly 


Lys 


Gly 


1 






5 








Asp Phe 


Lys 


Gly 


Gly 


Asn 


Val 


His 






20 










Asp Gin 


Lys 


He 


Glu 


Glu 


Ala 


Asn 


35 










40 


Asn Phe 


Ser 


Ala 


Ser 


Trp 


Cys 


Gly 


50 










55 




Tyr Ala 


Glu 


Met 


Ser 


Lys 


Thr 


Tyr 


65 








70 






Asp Val 


Asp 


Asp 


Leu 


Met 


Asp 


Phe 








85 








Thr Pro 


Thr 


Phe 


Phe 


Phe 


Leu 


Lys 






100 










Val Gly 


Ala 


Asn 


Lys 


Pro 


Glu 






115 












<210> 176 












<211> 106 












<212> PRT 












<213> Manduca sexta 








<400> 176 












Met Ser 


He 


His 


He 


Lys 


Asp 


Ala 


1 






5 








Glu Ala 


Gly 


Asp 


Lys 


Leu 


Val 


Val 






20 










Gly Pro 


Cys 


Lys 


Met 


He 


Gly 


Pro 




35 










40 


Met Ala 


Asp 


Ser 


He 


Val 


Val 


Val 


50 








55 




Asp He 


Ala 


Ala 


Asp 


Tyr 


Asn 


He 


65 








70 






Val Lys 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 








85 








Lys Leu 


Lys 


Asn 


Thr 


He 


Leu 


Lys 






100 











<210> 177 

<211> 221 

<212> PRT 

<213> Bradyrhizobium japonicum 



<400> 177 



Met 


Leu 


Asp 


Thr 


LyB 
5 


Pro 


Ser 


Ala 


1 

Ala 


Thr 


Val 


Ala 
20 


Val 


Gly 


Gly 


Leu 


Leu 


Gly 


Leu 


Ser 


Arg 


Ala 


Pro 


Thr 






35 










40 


Val 


Ala 


Thr 


Ala 


Gin 


Lys 


He 


Ala 




50 










55 




Ala 


Leu 


Thr 


Met 


Ala 


Ser 


Ala 


Pro 


65 










70 






Glu 


Asp 


Ala 


Asp 


Gly 


Lys 


Pro 


Lys 










85 








Thr 


Leu 


Leu 


Val 


Asn 


Leu 


Trp 


Ala 








100 










Glu 


Met 


Pro 


Ala 


Leu 


Asp 


Glu 


Leu 






115 










120 


Phe 


Glu 


Val 


Val 


Ala 


He 


Asn 


He 



Arg Ser He Val 


Glu 


Glu 


Lys 


Leu 


10 






15 




Val He Thr Thr 


Lys 


Glu 


Asp 


Trp 


25 




30 






Lys Asp Gly Lys 


He 
45 


Val 


Val 


Ala 


Pro- Cys Arg Val 


He 


Ala 


Pro 


Val 


60 










Pro Gin Leu Met 


Phe 


Leu 


Thr 


He 


75 








80 


Ser Ser Thr Trp 


Asp 


He 


Arg 


Ala 


90 






95 




Asn Gly Gin Gin 


He 


Asp 


Lys 


Leu 


105 




110 







Asp Asp Leu Lys 


Asn 


Arg 


Leu 


Ala 


10 






15 




He Asp Phe Met 


Ala 


Thr 


Trp 


Cys 


25 




30 






Lys Leu Asp Glu 


Met 


Ala 


Ala 


Glu 


45 








Lys Val Asp Val 


Asp 


Glu 


Cys 


Glu 


60 










Asn Ser Met Pro 


Thr 


Phe 


Val 


Phe 


75 








80 


Glu Phe Ser Gly 


Ala 


Asn 


Val 


Asp 


90 






95 





Leu Lys 
105 



Thr Arg Arg He 


Pro 


Leu 


Val 


He 


10 






15 




Ala Gly Phe Ala 


Ala 


Leu 


Tyr 


Gly 


25 




30 






Gly Asp Pro Ala 


Cys 
45 


Arg 


Ala 


Ala 


Pro Leu Ala His 


Gly 


Glu 


Val 


Ala 


60 










Leu Lys Leu Pro 


Asp 


Leu 


Ala 


Phe 


75 








80 


Lys Leu Ser Asp 


Phe 


Arg 


Gly 


Lys 


90 






95 




Thr Trp Cys Val 


Pro 


Cys 


Arg 


Lys 


105 




110 






Gin Gly Lys Leu 


Ser 
125 


Gly 


Pro 


Asn 


Asp Thr Arg Asp 


Pro 


Glu 


Lys 


Pro 
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130 135 140 

Lys Thr Phe Leu Lys Glu Ala Asn Leu Thr Arg Leu Gly Tyr Phe Asn 
145 150 155 160 

Asp Gin Lys Ala Lys Val Phe Gin Asp Leu Lys Ala lie Gly Arg Ala 

165 170 175 

Leu Gly Met Pro Thr Ser Val Leu Val Asp Pro Gin Gly Cys Glu lie 

180 185 190 

Ala Thr lie Ala Gly Pro Ala Glu Trp Ala Ser Glu Asp Ala Leu Lys 

195 200 205 

Leu lie Arg Ala Ala Thr Gly Lys Ala Ala Ala Ala Leu 
210 215 220 



<210> 178 
<211> 167 
<212> PRT 

<213> Haemophilus influenzae 



<400> 178 



Met Lys 


He 


Lys 


Lys Leu 


Leu Lys 


1 






5 




Phe He 


Val 


He 


Thr Ser 


He Leu 






20 






Pro Glu 


Glu 


He 


Asn Lys 


He Thr 




35 




40 


Phe Ser 


Leu 


Glu 


Ser Leu 


Asp Gin 


50 








55 


Trp Gly 


Thr 


Trp 


Cys Gly 


Tyr Cys 


65 






70 




Ser Leu 


Ala 


Lys 


Glu Gly 


Tyr Gin 








85 




Gly Asn 


Glu 


Ala 


Asp Val 


Asn Asp 






100 






Phe Thr 


Thr 


Val 


Asn Asp 


Pro Lys 




115 






120 


He Asn 


Val 


Thr 


Pro Thr 


He Val 


130 








135 


Leu Val 


Thr 


Thr 


Gly Leu 


Thr Ser 


145 






150 




Phe Phe 


Ala 


Glu 


Phe Phe 


Gly 








165 



Asn Gly Leu Ser Leu Phe Leu Thr 

10 15 
Asp Phe Val Arg Arg Pro Val Val 
25 30 
Leu Gin Asp Leu Gin Gly Asn Thr 
45 

Asn Lys Pro Thr Leu Leu Tyr Phe 
60 

Arg Tyr Thr Ser Pro Ala He Asn 

75 80 
Val Val Ser Val Ala Leu Arg Ser 

90 95 
Tyr Leu Ser Lys Asn Asp Tyr His 
105 110 
Gly Glu Phe Ala Glu Arg Trp Gin 
125 

Leu Leu Ser Lys Gly Lys Met Asp 
140 

Tyr Trp Gly Leu Lys Val Arg Leu 
155 160 



<210> 179 
<211> 163 
<212> PRT 

<213> Leishraania major 



<400> 179 



Met 


Leu 


Lys 


Val 


Ser 


Ser 


Lys Glu His Tyr Ala Glu He Lys Lys Lys 


1 








5 




10 15 


Ala 


Glu 


Asp 


Ser 


Leu 


Gly 


Leu Val Val His Phe Ser Ala Thr Trp Cys 








20 




25 30 


Glu 


Pro 


Cys 


Thr 


Ala 


Val 


Asn Glu His Leu Thr Lys Gin Ala Ala Glu 






35 








40 45 


Tyr 


Gly 


Asp 


Asn 


Val 


Val 


Phe Ala Glu Val Asp Cys Gly Glu' Leu Gly 




50 










55 60 


Asp 


Val 


Cys 


Glu 


Ala 


Glu 


Gly Val Glu Ser Val Pro Phe Val Ala Tyr 


65 










70 


75 80 


Phe 


Arg 


Thr 


Pro 


Leu 


Val 


Gly Asp Asp Arg Arg Val Glu Arg Val Ala 










85 




90 95 


Asp 


Val 


Ala 


Gly 


Ala 


Lys 


Phe Asp Gin He Asp Met Asn Thr His Ser 






100 






105 110 


Leu 


Phe 


Gly 


Glu 


Lys 


Gly 


Gly Asn Arg Gly Ser Ala Glu Gly Leu Cys 






115 








120 125 


His 


Ser 


Gly 


Arg 


Leu 


Pro 


Ala Leu Pro His Glu Ala Ala Arg Gly Arg 



130 135 140 
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Asn Val His His Arg His Pro lie Ser Ser Ala Leu Arg Leu Tyr Trp 
145 150 155 160 

Ser Ala Val 



<210> 180 

<211> 275 

<212> PRT 

<213> Mortierella alpina 



<400> 180 



Met 


Val 


Ser 


Asn 


Asn 


Tyr 


lie Asp 


He Thr Ser Glu Asp Asp Phe 


Ala 


1 








5 


10 15 




Gin 


Val 


Phe 


Gin 


Pro 


Ser 


Ser Ser 


Thr Val Tyr Ala Leu Asn Phe 


Trp 








20 








25 30 




Ala 


Ala 


Trp 


Ala 


Pro 


Pro 


Cys Val 


Gin Met Asn Glu Val Phe Glu 


Glu 






35 








40 


45 




Leu 


Ala 


Ala 


Lys 


Asn 


Ala 


Asn Val 


Asn Phe Leu Lys He Glu Ala 


Glu 




50 








55 


60 




Lys 


Phe 


Pro 


Asp 


He 


Ser 


Glu Asp 


Tyr Glu He Ala Ala Val Pro 


Ser 


65 








70 


75 


80 


Phe 


Val 


He 


Val 


Lys 


Glu 


Gly Thr 


Val Val Asp Arg Val Glu Gly 


Ala 










85 






90 95 




Asn 


Ala 


Pro 


Glu 


Leu 


Ala 


Lys Val 


He Ala Lys Tyr Ser Lys Ser 


Thr 








100 






105 110 




Ser 


Ser 


Pro 


Leu 


Pro 


Thr 


Gin Ser 


Ser Thr Met Ala Ala Ala Gly 


His 






115 








120 


125 




Ala 


Ala 


Pro 


Ser 


Val 


Ala 


Pro Pro 


Thr Met Ser Pro Glu Glu Met 


Asn 




130 










135 


140 




Ala 


Arg 


Leu 


Lys 


Glu 


Leu 


Thr Ser 


Ser Ser Ser Val Met Ala Phe 


He 


145 








150 




155 


160 


Lys 


Gly 


Thr 


Pro 


Thr 


Ala 


Pro Arg 


Cys Gin Phe Ser Arg Gin Leu 


Leu 










165 




170 175 




Glu 


He 


Leu 


Thr 


Ala 


Gin 


Asn lie 


Arg Phe Ser Ser Phe Asn He 


Leu 








180 








185 190 




Ala 


Asp 


Asp 


Glu 


Val 


Arg 


Gin Ala 


Met Lys Thr Phe Ser Asp Trp 


Pro 






195 








200 


205 




Thr 


Phe 


Pro 


Gin 


Val 


Tyr 


Val Lys 


Gly Glu Phe Val Gly Gly Leu 


Asp 




210 










215 


220 




Val 


Val 


Lys 


Glu 


Leu 


Val 


Ala Ser 


Gly Glu Phe Gin Ala Leu Val 


Pro 


225 








230 




235 


240 


Ala 


Glu 


Lys 


Asp 


Leu 


Lys 


Thr Arg 


Met Asp Glu Leu He Arg Lys 


Ala 










245 






250 255 




Pro 


Val 


Met 


He 


Phe 


lie 


Lys Gly 


Ser Pro Glu Thr Pro Arg Cys 


Gly 








260 






265 270 




Phe 


Ser 


Lys 


















275 















<210> 181 
<211> 160 
<212> PRT 

<213> Neisseria gonorrhoeae 
<400> 181 

Met Lys Arg 'Leu He Leu Ala Ala 

1 5 
Ala His Thr Ala Ser Gly Asp Glu 
20 

Pro Gin Asn Leu Gin Ser Leu Lys 

35 40 
Trp Ala Thr Trp Cys Gly Pro Cys 

50 55 
Lys Trp Tyr Lys Ala Gin Lys Lys 
65 70 
Ala Leu Asp Thr Ser Asp Asn lie 



lie Ala Leu Ala Ala Thr Phe Gly 

10 15 
Leu Ala Gly Trp Lys Asp Asn Thr 
25 30 
Ala Pro Val Arg He Ala Asn Leu 
45 

Arg Lys Glu Met Pro Ala Met Ser 
60 

Gly Ser Val Asp Met Val Gly He 

75 80 
Gly Asn Phe Leu Lys Gin Thr Pro 
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85 

Val Ser Tyr Pro lie Trp Arg Tyr 
100 

Met Lys Ser Tyr Gly Asn Asn Val 
115 120 
Glu Ala Pro Lys Cys Gly Tyr Arg 

130 135 
Glu Lys Ser Leu Thr Glu Ala Val 
145 150 



90 95 
Thr Gly Ala Asn Ser Arg Ser Phe 
105 110 
Gly Val Leu Pro Phe Thr Val Val 
125 

Gin Thr lie Thr Gly Glu Leu Asn 
140 

Lys Leu Ala His Ser Lys Cys Arg 
155 160 



<210> 182 

<211> 208 

<212> PRT 

<213> Rhizobium loti 



<400> 182 



Met 


Ala 


Gly 


Ala 


Leu 


Ala 


Gly Ala Val Ala Val 


Tyr Val Ser Glu Ser 


1 






5 




10 


15 


Arg 


Ser 


Gly 


Asn 


Asn 


Ala 


Pro Ala Arg Val Ala 


Val Gly Gly Ser Lys 








20 






25 


30 


Asp 


Asp 


Val 


Ala 


Cys 


Ala 


Ala Lys Ser Gly Arg 


Ala Lys Lys lie Ala 






35 








40 


45 


Ala 


Ala 


Ala 


Thr 


Gly 


Glu 


Val Ala Ala Leu Leu 


Pro Ala Asp Pro Pro 




50 








55 


60 


Gin 


Ser 


Met 


Lys 


Ser 


Leu 


Ala Phe Asn Gly Pro 


Asp Gly Lys Pro Met 


65 










70 


75 


80 


Thr 


lie 


Ala 


Asp 


His 


Ala 


Gly Lys Thr Val Leu 


Leu Asn Leu Trp Ala 










85 




90 


95 


Thr 


Trp 


Cys 


Ala 


Pro 


Cys 


Arg Ala Glu Met Pro 


Ala Leu Asn Ala Leu 








100 






105 


110 


Gin 


Lys 


Asp 


Lys 


Gly 


Ser 


Asp Ala Phe Gin Val 


lie Ala Val Asn Val 






115 








120 


125 


Asp 


Ala 


Gly 


Asp 


Asp 


Val 


Lys Pro Lys Lys Phe 


Leu Lys Glu Thr Gly 




130 










135 


140 


Val 


Glu 


Ala 


Leu 


Gly 


Tyr 


Phe Arg Asp Ser Thr 


Val Ala Leu Phe Asn 


145 










150 


155 


160 


Asp 


Leu 


Lys 


Ala 


Arg 


Gly 


Leu Ala Leu Gly Leu 


Pro Val Thr Met Leu 










165 




170 


175 


lie 


Asp 


Ser 


Glu 


Gly 


Cys 


Leu lie Ala His Met 


Asn Gly Pro Ala Glu 








180 






185 


190 


Trp 


Ser 


Gly 


Arg 


Asp 


Ala 


Arg Arg Leu Val Glu 


Thr Ala Leu Gly Ser 






195 








200 


205 



<210> 183 

<211> 176 

<212> PRT 

<213> Rhodobacter capsulatus 



<400> 183 



Met 


Ala 


Lys 


Pro 


Leu 


Met 


Phe 


Leu Pro Leu Leu 


Val Met Ala Gly Phe 


1 






5 






10 


15 


Val 


Gly 


Ala 


Gly 


Tyr 


Phe 


Ala 


Met Gin Gin Asn 


Asp Pro Asn Ala Met 






20 






25 


30 


Pro 


Thr 


Ala 


Leu 


Ala 


Gly 


Lys 


Glu Ala Pro Ala 


Val Arg Leu Glu Pro 






35 








40 


45 


Leu 


Gly 


Ala 


Glu 


Ala 


Pro 


Phe 


Thr Asp Ala Asp 


Leu Arg Asp Gly Lys 




50 










55 




60 


lie 


Lys 


Leu 


Val 


Asn 


Phe 


Trp 


Ala Ser Trp Cys 


Ala Pro Cys Arg Val 


65 










70 




75 


80 


Glu 


His 


Pro 


Asn 


Leu 


lie 


Gly 


Leu Lys Gin Asp 


Gly He Glu He Met 










85 






90 


95 


Gly 


Val 


Asn 


Trp 


Lys 


Asp 


Thr 


Pro Asp Gin Ala 


Gin Gly Phe Leu Ala 








100 






105 


110 


Glu 


Met 


Gly 


Ser 


Pro 


Tyr 


Thr 


Arg Leu Gly Ala 


Asp Pro Gly Asn Lys 






115 










120 


125 
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Met Gly Leu Asp Trp 


Gly 


Val 


Ala 


130 




135 




Asp Gly Ala Gly Arg 


He 


Leu 


Thr 


145 


150 






Asp Val He Thr Lys 


Lys 


He 


Asp 



165 



Gly Val Pro Glu Thr Phe Val Val 
140 

Arg He Ala Gly Pro Leu Thr Glu 
155 160 
Pro Leu Leu Ala Gly Thr Ala Asp 
170 175 



<210> 184 
<211> 105 
<212> PRT 

<213> Syne cho cyst is 



<400> 184 



Met Ala 


Val 


Lys 


Lys 


Gin 


Phe 


Ala 


1 






5 








Ser Pro 


Lys 


Pro 


Val 


Leu 


Val 


Asp 




20 










Cys Gin 


Met 


Met 


Ala 


Pro 


He 


Leu 


35 










40 


Gin Gin 


He 


Gin 


Val 


Val 


Lys 


He 


50 










55 




Ala Thr 


Gin 


Tyr 


Gin 


He 


Gin 


Ser 


65 






70 






Gin Gly 


Gin 


Pro 


Val 


His 


Arg 


Met 






85 








Leu He 


Gin 


Gin 


Leu 


Gin 


Val 


Phe 



100 



Asn 


Phe 


Ala Glu Met Leu Ala Gly 




10 


15 


Phe 


Tyr 


Ala Thr Trp Cys Gly Pro 


25 




30 


Glu 


Gin 


Val Gly Ser His Leu Arg 






45 


Asp 


Thr 


Asp Lys Tyr Pro Ala He 






60 


Leu 


Pro 


Thr Leu Val Leu Phe Lys 






75 80 


Glu 


Gly 


Val Gin Gin Ala Ala Gin 




90 


95 


Val 






105 







<210> 185 

<211> 109 

<212> PRT 

<213> Synechocystis 

<400> 185 



Met Ser 


Leu 


Leu 


Glu 


He 


Thr 


Asp 


Ala 


Glu 


Phe Glu Gin Glu Thr Gin 


1 






5 








10 


15 


Gly Gin 


Thr 


Lys 


Pro 


Val 


Leu 


Val 


Tyr 


Phe 


Trp Ala Ser Trp Cys Gly 






20 










25 




30 


Pro Cys 


Arg 


Leu 


Met 


Ala 


Pro 


Ala 


He 


Gin 


Ala He Ala Lys Asp Tyr 




35 










40 






45 


Gly Asp 


Lys 


Leu 


Lys 


Val 


Leu 


Lys 


Leu 


Glu 


Val Asp Pro Asn Pro Ala 


50 










55 








60 


Ala Val 


Ala 


Gin 


Cys 


Lys 


Val 


Glu 


Gly 


Val 


Pro Ala Leu Arg Leu Phe 


65 






70 










75 80 


Lys Asn 


Asn 


Glu 


Leu 


Val 


Met 


Thr 


His 


Glu 


Gly Ala He Ala Lys Pro 






85 










90 


95 


Lys Leu 


Leu 


Glu 


Leu 


Leu 


Lys 


Glu 


Glu 


Leu 


Asp Phe He 


•* 




100 










105 







<210> 186 
<211> 290 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 186 

Met Ser Val He Glu He Arg Ser Tyr Gin His Trp He Ser Thr He 

1 5 10 15 

Pro Lys Ser Gly Tyr Leu Ala Val Asp Cys Tyr Ala Asp Trp Cys Gly 

20 25 30 

Pro Cys Lys Ala He Ser Pro Leu Phe Ser Gin Leu Ala Ser Lys Tyr 

35 40 45 

Ala Ser Pro Lys Phe Val Phe Ala Lys Val Asn Val Asp Glu Gin Arg 

50 55 60 

Gin He Ala Ser Gly Leu Gly Val Lys Ala Met Pro Thr Phe- Val Phe 
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65 70 75 80 



Phe 


Glu 


Asn 


Gly 


Lys 


Gin 


He 


Asp 


Met 


Leu Thr Gly Ala Asn Pro Gin 








85 










90 95 


Ala 


Leu 


Lys 


Glu 


Lys 


Val 


Ala 


Leu 


He 


Ser Ser Lys Ala Thr Gly Thr 






100 








105 


110 


Gly 


Ala 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 


Ala 


Pro Val Lys Gly Phe Ala Ser 






115 










120 




125 


Leu 


Gin 


Gly 


Cys 


He 


Glu 


Asn 


Pro 


Gin 


Leu Glu Cys Leu Asn Gin Gin 




130 






135 






1 A A 

140 


Asp 


Asp 


His 


Asp 


Leu 


Lys 


Ser 


Ala 


Phe 


Asn Ser Asn Pro Ser Ser Phe 


145 






150 








155 160 


Leu 


Glu 


Ser 


Asp 


Val 


As P 


Glu 


Gin 


Leu 


Met lie Tyr lie Pro Pne Leu 








165 








1 1 A 1 TC 


Glu 


Val 


Val 


Lys 


Val 


His 


Ser 


He 


Ala 


He Tnr Pro val Lys Gly Glu 








180 










185 


iyu 


Thr 


Ser 


Ser 


Ala 


Pro 


Lys 


Thr 


He 


Lys 


Leu Tyr He Asn Gin Pro Asn 






195 








200 




205 


Asn 


Leu 


Ser 


Phe 


Glu 


Asp 


Ala 


Glu 


Ser 


Phe Thr Pro Thr Gin Val He 




210 










215 






220 


Glu 


Asp 


lie 


Val 


Tyr 


Glu 


Gin 


Asp 


Asp 


Gin Pro Thr He He Pro Leu 


225 








230 








235 240 


Arg 


Phe 


Val 


Lys 


Phe 


Gin 


Arg 


Val 


Asn 


Ser Leu Val He Phe He Tyr 






245 








250 255 


Ser 


Asn 


Val 


Gly 


Glu 


Glu 


Glu 


Thr 


Thr 


Lys He Ser Arg Leu Glu. Leu 








260 










265 


i »"7 n 
270 


Phe 


Gly 


Glu 


Pro 


Val 


Gly 


Asp 


Ser 


Ser 


Lys Gly Lys Leu Gin Lys Val 






275 










280 




285 


Glu 


Ala 




















290 


















<210> 187 
















<211> 185 
















<212> PRT 
















<213> Treponema 


pallidum 








<400> 187 
















Met 


Phe 


Arg 


Ser 


Asp 


Leu 


Val 


Leu 


Ala 


Val Trp Gly Val Thr Cys Val 


1 








5 










10 15 


Gin 


Ala 


Ala 


Asp 


Val 


Ala 


His 


Asn 


Ala 


Asp Val Pro Ser Arg Ser Leu 








20 










25 


30 


Lys 


Ala 


Leu 


Glu 


Arg 


Phe 


Arg 


Phe 


Phe 


Val Tyr Pro Lys Pro Leu Asp 






35 










40 




45 


Leu 


Ser 


Ser 


Asp 


Phe 


His 


Ala 


Lys 


Ala 


Leu Lys Gly Glu Ala Leu Val 




50 










55 






60 


Pro 


Ser 


Leu 


Phe 


Lys 


Gly 


Lys 


Val 


Thr 


Leu Leu Asn Phe Trp Ala Thr 


65 










70 








75 80 


Trp 


Cys 


Pro 


Pro 


Cys 


Arg 


Ala 


Glu 


Met 


Pro Ser Met Asp Arg Met Gin 






85 










90 95 


Ala 


Leu 


Met 


Arg 


Gly 


Asn 


Asp 


Phe 


Gin 


He Val Ala Val Asn Val Gly 








100 










105 


1 *1 A 

110 


Asp 


Ser 


Arg 


Lys 


Gin 


Val 


Glu 


Ser 


Phe 


He Ala Arg Gly Lys His Thr 






115 










120 






Phe 


Pro 


lie 


Tyr 


Leu 


Asp 


Glu 


Glu 


Gly 


Ser Leu Gly Ser Val Phe Ala 




130 






135 






140 


Ser 


Arg 


Gly 


Leu 


Pro 


Thr 


Thr 


Tyr 


Val 


Val Asp Lys Ala Gly Arg He 


145 








150 








155 160 


Val 


Ala 


Val 


Val 


Val 


Gly 


Ser 


Val 


Glu 


Tyr Asp Gin Pro Glu Leu Val 










165 










170 175 


Ala 


Leu 


Phe 


Lys 


Glu 


Leu 


Ala 


Arg 


Asp 










180 










185 





<210> 188 
<211> 246 
<212> PRT 

<213> Caenorhabditis elegans 
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<400> 188 



Met 


Leu 


Leu Arg 


Leu 


Leu 


Ala 


val 


1 






5 








Ser 


Gly 


Gly Pro 


Thr 


Arg 


Ser 


Ser 






20 










Asn 


Trp 


Thr Asp 


Leu 


Met 


Lys 


Gly 






35 








40 


Pro 


Trp 


Cys Pro 


Ala 


Cys 


Lys 


Asp 




50 








55 




Ala 


Asp 


Trp Ser 


Asp 


Asp 


Leu 


Gly 


65 








70 






Thr 


Val 


Asn Pro 


Gly 


Leu 


Ser 


Gly 








85 








Thr 


He 


Tyr His 


Val 


Lys 


Asp 


Gly 






100 










Arg 


Asp 


Lys Asn 


Asp 


Phe 


He 


Ser 






115 








120 


Val 


He 


Asp Pro 


Val 


Pro 


Asp 


Tyr 




130 








135 




Ala 


Val 


Val Ala 


Val 


Phe 


Phe 


Lys 


145 








150 






His 


Asn 


His Leu 


Val 


Glu 


Asp 


Lys 








165 








Gly 


Leu 


Phe Ala 


Gly 


Val 


Thr 


Leu 




180 










Phe 


He 


Val He 


He 


He 


Asp 


Gin 






195 








200 


Ser 


Gin 


Gin Ala 


Lys 


Lys 


Thr 


Glu 




210 








215 




Gly 


Thr 


Glu Ser 


Pro 


Thr 


Lys 


Lys 


225 








230 






Glu 


Thr 


Lys Lys 


Thr 


Lys 












245 









<210> 189 

<211> 284 

<212> PRT 

<213> Caenorhabditis elegans 



<400> 189 



Met 


Pro 


Val 


He 


Asn 


Val 


Lys 


Asp 


1 








5 








Ser 


Leu 


Ala 


Gly 
20 


Leu 


Lys 


Ser 


Val 


Cys 


Gly 


Pro 
35 


Cys 


Lys 


Met 


He 


Ala 
40 


Gin 


Tyr 
50 


Leu 


Gly 


Ala 


Val 


Phe 
55 


Leu 


Lys 


Thr 


Ser 


Ser 


Glu 


Asn 


Gly 


Val 


65 










70 






Phe 


Gin 


Ser 


Gly 


Val 
85 


Arg 


Val 


Glu 


Ala 


Leu 


Glu 


Thr 
100 


Met 


Val 


Lys 


Lys 


Ser 


Leu 


Val 


Ala 


Gly 


Gin 


Met 


Asp 






115 








120 


Gin 


Met 


Glu 


Cys 


Leu 


Asn 


Glu 


Ser 




130 








135 




Leu 


Glu 


Gly 


Asn 


Cys 


Asn 


Leu 


Val 


145 






150 






He 


Ser 


Leu 


Pro 


Phe 
165 


Asn 


Gin 


Pro 


Lys 


Gly 


Val 


Ser 
180 


Asp 


Arg 


Ala 


Pro 


Leu 


Pro 


Lys 


Thr 


Thr 


Asp 


Phe Asp 






195 










200 



Leu 


Gly Leu 


Phe Ala Val Gly Val 




10 


15 


Lys 


Leu Val 


Phe Leu Asn Glu Glu 


25 




30 


Glu 


Trp Met 


He Glu Phe His Ala 




45 


Leu 


Gin Lys 


Ala Trp Asn Ala Phe 






60 


He 


Lys Val 


Gly Glu Val Asp Val 




75 


80 


Arg 


Phe Leu 


Val Thr Ala Leu Pro 


90 


95 


Val 


Phe Arg 


Gin Tyr Ser Gly Ala 


105 




110 


Phe 


Val Glu 


Asp Lys Lys Tyr Arg 






125 


Lys 


His Pro 


Asn Ser Lys Gin Met 




140 


Leu 


Ser Met 


Ser Val Arg Asp Leu 




155 


160 


Gly 


He Pro 


Ser Trp Ala Ser Tyr 




170 


175 


Ala 


Leu Gly 


Cys Val Leu Gly Phe 


185 




190 


Val 


Phe Pro 


Thr Gly Pro Arg Lys 






205 


Lys 


Lys Asp 


Ala Lys Lys Asp Ser 






220 


Asn 


Gly Asn 


Asn Asn Asn Gly Lys 




235 


240 


Asp 


Glu Asp 


Phe Arg Asn Gin Leu 




10 


15 


He 


Val Asp 


Phe Thr Ala Val Trp 


25 , 




30 


Pro 


Thr Phe 


Glu Ala Leu Ser Asn 






45 


Lys 


Val Asp 


Val Glu He Cys Glu 






60 


Asn 


Ser Met 


Pro Thr Phe Met Val 




75 


80 


Gin 


Met Lys 


Gly Ala Asp Ala Lys 




90 


95 


Tyr 


Ala Asp 


Asn Ser Ala Ala Asp 


105 




110 


Leu 


Thr Pro 


Leu Val Asp Lys Lys 






125 


Asp 


Asp Thr 


Pro Leu Gly Arg Phe 






140 


Ser 


Asp Cys 


Asp Glu Gin Leu He 




155 


160 


Val 


Lys Val 


His Ser He Leu He 




170 


175 


Lys 


Lys Val 


Lys Val Phe He Asn 


185 




190 


Asn 


Ala Thr 


Ala Leu Glu Pro Thr 



205 
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Gin 


Met Leu 


Glu 


Phe Asp Glu Ser 


Ser 




210 




215 




Val 


Ala Leu 


Lys 


Tyr Val Lys Phe 


Gin 


225 






230 




Phe 


lie Glu 


Asn 


Asn Val Gly Gly 


Gly 








245 




Leu 


Thr Val 


Phe 


Gly Thr Pro Leu 


Ser 






260 


265 


Lys Arg Val 


Ala 


Gly Lys Ala Gly 


Asp 




275 




280 





<210> 190 
<211> 287 
<212> PRT 

<213> Drosophila melanogaster 
<400> 190 



Met 


Ser 


Val 


Arg 


Val 


He Asn Asp 


Glu 


1 








5 






Ala 


Gin 


Ala 


Gly 


He 


Gin Leu Val 


Val 








20 






25 


Cys 


Gly 


Pro 


Cys 


Lys 


Arg He Ala 


Pro 






35 






40 




Lys 


Tyr 


Pro 


Lys 


Ala 


He Phe Leu 


Lys 




50 








55 




Asp 


Thr 


Ala 


Ala 


Gly 


Gin Gly Val 


Ser 


65 










70 




Tyr 


Arg 


Asn 


Arg 


Thr 


Lys He Asp 


Arg 










85 






Gly 


Leu 


Glu 


Ala 


Lys 


He Gin Glu 


His 






100 






105 


Glu 


Gly 


Gly 


Glu 


Asp 


Tyr Gly Gin 


Gly 






115 






120 




lie 


Ser 


Lys 


Gin 


Glu 


Cys Glu Cys 


Leu 




130 








135 




Leu 


Lys 


His 


Ala 


Leu 


Ala Ser Ala 


Gly 


145 










150 




Asp 


Glu 


Gin 


Leu 


He 


Leu Ser He 


Thr 








165 






His 


Ser 


Leu 


Lys 


Phe 


Lys Ala Pro 


Ser 








180 




185 


Lys 


Leu 


Phe 


lie 


Asn 


Gin Pro Arg 


Thr 




195 






200 




Ser 


Met 


Asn 


Ser 


Val 


Gin Asp Leu 


Ser 




210 








215 




Ser 


Gly 


Val 


Pro 


Val 


Asn Leu Arg 


Tyr 


225 










230 




As*n 


He 


Gin 


He 


Phe 


Val Lys Asn 


Asn 










245 




Gin 


He 


Asp 


Tyr 


He 


Gly Phe He 


Gly 








260 






265 


Met 


Asn 


Asp 


Phe 


Lys 


Arg Val Ala 


Gly 






275 






280 





<210> 191 
<211> 289 
<212> PRT 
• <213> Homo sapien 

<400> 191 

Met Val Gly Val Lys Pro Val Gly Ser 

1 5 

Leu Ser Gly Ala Gly Ser Arg Leu Ala 

20 25 

Gly Cys Gly Pro Cys Leu Arg He Ala 
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He 


Gin 


Gly 


His 


Gly Gin Val 






220 








Asn 


Val 


Gin 


Asn 


He 


Gin Phe 




235 








240 


Asp 


Val 


Thr 


Glu 


Leu 


Val Lys 


250 










255 


Ala 


Leu 


Asn 


Met 


Asn 


Glu Phe 










270 




Ala 


Ala 


His 








Ser 


His 


Phe 


Gin 


Ala 


Glu Leu 


10 










15 


Val 


Asp 


Phe 


Thr 


Ala 


Ser Trp 








30 




He 


Phe 


Glu 


Thr 


Phe 


Pro Thr 








45 






Val 


Asp 


Val 


Asp 


Lys 


Cys Gin 






60 








Ala 


Met 


Pro 


Thr 


Phe 


He Phe 




75 








80 


Val 


Gin 


Gly 


Ala 


Asp 


Val Asn 


90 










95 


He 


Gly 


Thr 


Ser 


Gly Gly Glu 










110 




Leu 


Met 


Glu 


Leu 


Asn 


Thr Phe 








125 






Asn 


Glu 


Ala 


Asp 


Asp 


His Asn 






140 








Gly 


Tyr 


Leu 


Gin 


Ser Asp Cys 




155 








160 


Phe 


Asn 


Gin 


Ala 


Val 


Lys He 


170 










175 


His 


Leu 


Gly 


Pro 


Lys Asp Val 










190 




He 


Asp 


Phe 


Asp 


Met 


Ala Glu 






205 






Leu 


Ala 


Gin 


Lys 


Glu 


Leu Glu 






220 






Val , 


,Lys 


Phe 


Gin 


Asn 


Val Gin 




235 








240 


Gin 


Ser 


Gly 


Gly 


Asp 


Val Thr 


250 










255 


Ser 


Pro 


He 


Met 


Thr Thr Lys 










270 




Lys 


Lys 


Gly 


Glu 


Ser 


His 








285 






Asp 


Pro 


Asp 


Phe 


Gin 


Pro Glu 


10 










15 


Val 


Val 


Lys 


Phe 


Thr Met Arg 










30 




Pro 


Ala 


Phe 


Ser 


Ser 


Met Ser 
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35 






40 


Asn 


Lys 


Tyr 


Pro 


Gin 


Ala Val Phe 




50 








55 


Gin 


Gly 


Thr 


Ala 


Ala 


Thr Asn Asn 


65 








70 


Phe 


Phe 


Arg 


Asn 


Lys 


Val Arg lie 










85 




Val 


Gly 


Leu 


Glu 


Glu 


Lys lie Lys 








100 






Ser 


Asn 


Glu 


Asp 


Thr 


Asp lie Pro 






115 






120 


Phe 


lie 


Asn 


Lys 


Ala 


Gly Cys Glu 




130 








135 


Gly 


Phe 


Asp 


Asn 


Cys 


Leu Arg Lys 


145 










150 


Cys 


Asp 


Glu 


Gin 


Leu 


Leu He Thr 










165 




Leu 


Tyr 


Ser 


Met 


Lys 


Phe Gin Gly 








180 






Tyr 


Val 


Lys 


lie 


Phe 


He Asn Leu 




195 






200 


Ala 


Glu 


Arg 


Ser 


Glu 


Pro Thr Gin 




210 






215 


lie 


Lys 


Glu 


Asp 


Gly 


He Val Pro 


225 










230 


Val 


Asn 


Ser 


Val 


Thr 


He Phe Val 










245 




Thr 


Thr 


Arg 


lie 


Ser 


Tyr Phe Thr 








260 






Thr 


Asn 


Met 


Asn 


Asp 


Phe Lys Arg 






275 






280 


His 













Leu 


Glu 


Val 


Asp 


Val 


His Gin Cys 








60 






He 


Ser 


Ala 


Thr 


Pro 


Thr Phe Leu 






75 






80 


Asp 


Gin 


Tyr 


Gin 


Gly 


Ala Asp Ala 




so 








95 


Gin 


His 


Leu 


Glu 


Asn 


Asp Pro Gly 


105 










110 


Lys 


Gly 


Tyr 


Met 


Asp 


Leu Met Pro 










125 




Cys 


Leu 


Asn 


Glu 


Ser 


Asp Glu His 








140 






Asp 


Thr 


Thr 


Phe 


Leu 


Glu Ser Asp 






155 






160 


Val 


Ala 


Phe 


Asn 


Gin 


Pro Val Lys 




170 








175 


Pro 


Asp 


Asn 


Gly 


Gin 


Gly Pro Lys 


185 










190 


Pro 


Arg 


Ser 


Met 


Asp 


Phe Glu Glu 










205 




Ala 


Leu 


Glu 


Leu 


Thr 


Glu Asp Asp 








220 






Leu 


Arg 


Tyr 


Val 


Lys 


Phe Gin Asn 






235 






240 


Gin 


Ser 


Asn 


Gin 


Gly 


Glu Glu Glu 




250 






255 


Phe 


He 


Gly 


Thr 


Pro 


Val Gin Ala 


265 








270 


Val 


Val 


Gly 


Lys 


Lys 


Gly Glu Ser 



285 



<210> 192 
<211> 335 
<212> PRT 
<213> Homo sapien 



<400> 192 



Met 


Glu 


Ala 


Gly 


Ala 


Ala Glu Ala 


1 






5 




Gly 


Ser 


Ala 


Gly 


Gin 


Phe Glu Glu 








20. 






Leu 


Leu 


Val 


Val 


His 


Phe Trp Ala 






35 






40 


Met 


Asn 


Glu 


Val 


Met 


Ala Glu Leu 




50 








55 


Phe 


Val 


Lys 


Leu 


Glu 


Ala Glu Gly 


65 










70 


Glu 


He 


Ser 


Ser 


Val 


Pro Thr Phe 










85 




He 


Asp 


Arg 


Leu 


Asp 


Gly Ala His 








100 






Gin 


Arg 


His 


Ala 


Ser 


Ser Gly Ser 






115 






120 


Leu 


Lys 


Glu 


Asp 


Leu 


Asn Leu Arg 




130 








135 


Pro 


Cys 


Met 


Leu 


Phe 


Met Lys Gly 


145 










150 


Phe 


Ser 


Lys 


Gin 


Met 


Val Glu He 










165 




Ser 


Ser 


Phe 


Asp 


He 


Phe Ser Asp 








180 




Ala 


Tyr 


Ser 


Ser 


Trp 


Pro Thr Tyr 






195 






200 



Ala 


Val 


Ala 


Ala 


Val 


Glu Glu Val 




10 








15 


Leu 


Leu 


Arg 


Leu 


Lys 


Ala Lys Ser 


25 










30 


Pro 


Trp 


Ala 


Pro 


Gin 


Cys Ala Gin 










45 




Ala 


Lys 


Glu 


Leu 


Pro 


Gin Val Ser 








60 






Val 


Pro 


Glu 


Val 


Ser 


Glu Lys Tyr 






75 






80 


Leu 


Phe 


Phe 


Lys 


Asn 


Ser Gin Lys 




90 








95 


Ala 


Pro 


Glu 


Leu 


Thr 


Lys Lys Val 


105 










110 


Phe 


Leu 


Pro 


Ser 


Ala 


Asn' Glu His 










125 




Leu 


Lys 


Lys 


Leu 


Thr 


His Ala Ala 




140 






Thr 


Pro 


Gin 


Glu 


Pro 


Arg Cys Gly 






155 






160 


Leu 


His 


Lys 


His 


Asn 


He Gin Phe 




170 








175 


Glu 


Glu 


Val 


Arg 


Gin 


Gly Leu Lys 


185 










190 


Pro 


Gin 


Leu 


Tyr 


Val 


Ser Gly Glu 










205 
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Leu 


He 


Gly Gly 


Leu 


Asp 


He 


lie 




210 






215 




Leu 


Asp 


Thr He 


Cys 


Pro 


Lys 


Ala 


225 








230 






Val 


Leu 


Thr Asn 


Lys 
245 


Ala 


Ser 


Val 


Gin 


Glu 


Ala Lys 
260 


Cys 


Gly 


Phe 


Ser 


Ser 


Thr 


Gly Val 


Glu 


Tyr 


Glu 


Thr 






275 






280 


Val 


Arg 
290 


Gin Gly 


Leu 


Lys 


Ala 
295 


Tyr 


Leu 


Tyr 


Val Lys 


Gly 


Glu 


Leu 


Val 


305 








310 






Leu 


Lys 


Glu Asn 


Gly 
325 


Glu 


Leu 


Leu 



Lys Glu Leu Glu Ala Ser Glu Glu 
220 

Pro Lys Leu Glu Glu Arg Leu Lys 
235 240 
Met Leu Phe Met Lys Gly Asn Lys 

250 255 
Lys Gin He Leu Glu He Leu Asn 
265 270 
Phe Asp He Leu Glu Asp Glu Glu 
285 

Ser Asn Trp Pro Thr Tyr Pro Gin 
300 

Gly Gly Leu Asp He Val Lys Glu 
315 320 
Pro He Leu Arg Gly Glu Asn 
330 335 



<210> 193 
<211> 131 
<212> PRT 

<213> Phalaris coerulescens 
<400> 193 



Met 


Gly 


Gly 


Cys 


Val 


Gly 


Lys 


Asp Arg Gly He Val Glu Asp Lys Leu 


1 








5 






10 15 


Asp 


Phe 


Lys 


Gly 
20 


Gly 


Asn 


Val 


His Val He Thr Thr Lys Glu Asp Trp 
25 " 30 


Asp 


Gin 


Lys 
35 


He 


Ala 


Glu 


Ala 


Asn Lys Asp Gly Lys lie Val Val Ala 
40 45 


Asn 


Phe 
50 


Ser 


Ala 


Ser 


Trp 


Cys 
55 


Gly Pro Cys Arg Val He Ala Pro Val 
60 


Tyr 


Ala 


Glu 


Met 


Ser 


Lys 


Thr 


Tyr Pro Gin Leu Met Phe Leu Thr He 


65 










70 




75 80 


Asp 


Val 


Asp 


Asp 


Leu 
85 


Val 


Asp 


Phe Ser Ser Thr Trp Asp He Arg Ala 
90 95 


Thr 


Pro 


Thr 


Phe 
100 


Phe 


Phe 


Leu 


Lys Asn Gly Gin Gin He Asp Lys Leu 
105 110 


Val 


Gly 


Ala 
115 


Asn 


Lys 


Pro 


Glu 


Leu Glu Lys Lys Val Gin Ala Leu Gly 
120 125 


Asp 


Gly 
130 


Ser 













<210> 194 
<211> 144 
<212> PRT 

<213> Trypanosoma brucei brucei 



<400> 194 



Met 


Ser 


Gly 


Leu 


Ala 


Lys 


Tyr 


Leu Pro Gly Ala Thr Asn Leu Leu Ser 


1 








5 




10 15 


Lys 


Ser 


Gly 


Glu 


Val 


Ser 


Leu 


Gly Ser Leu Val Gly Lys Thr Val Phe 








20 








25 30 


Leu 


Tyr 


Phe 


Ser 


Ala 


Ser 


Trp 


Cys Pro Pro Cys Arg Gly Phe Thr Pro 






35 










40 45 


Val 


Leu 


Ala 


Glu 


Phe 


Tyr 


Glu 


Lys His His Val Ala Lys Asn Phe Glu 




50 










55 


60 


Val 


Val 


Leu 


He 


Ser 


Trp 


Asp 


Glu Asn Glu Ser Asp Phe His Asp Tyr 


65 










70 




75 80 


Tyr 


Gly 


Lys 


Met 


Pro 


Trp 


Leu 


Ala Leu Pro Phe Asp Gin Arg Ser Thr 










85 






90 95 


Val 


Ser 


Glu 


Leu 


Gly 


Lys 


Thr 


Phe Gly Val Glu Ser He Pro Thr Leu 








100 








105 110 


He 


Thr 


He 


Asn 


Ala 


Asp 


Thr 


Gly Ala He He Gly Thr Gin Ala Arg 






115 










120 125 


Thr 


Arg 


Val 


He 


Glu 


Asp 


Pro 


Asp Gly Ala Asn Phe Pro Trp Pro Asn 
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130 135 140 



<210> 195 
<211> 333 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 195 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Giy Ser 

15 10 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 

35 40 45 

Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 

50 55 60 

Glu Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

Glu Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

85 90 95 

Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 

100 " 105 110 

Ala Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser 

115 120 125 

Phe Val Gly Ser Gly Glu Ala Ser Gly Gly Phe Trp Asn Arg Gly He 

130 135 " 140 

Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 
145 150 155 160 

Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 

165 170 175 

Phe Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp 

180 185 190 

Ala Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro 

195 200 205 

Lys He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 

210 215 220 

Gly Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 
225 230 235 240 

Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly 

245 250 255 

His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 

260 265 " " 270 

Asp Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro 

275 280 285 

Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 

290 295 300 

He Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His 
305 310 315 320 

Tyr Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp 
325 330 



<210> 196 
<211> 383 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 196 

Met Cys Trp He Ser Met Ser Gin Ser Arg Phe He He Lys Ser Leu 

15 10 15 

Phe Ser Thr Ala Gly Gly Phe Leu Leu Gly Ser Ala Leu Ser Asn Pro 

20 25 30 

Pro Ser Leu Ala Thr Ala Phe Ser Ser Ser Ser Ser Ser Ser Ser Ala 

35 40 45 

Ala Ala Ala Val Asp Met Glu Thr His Lys Thr Lys Val Cys He Val 
50 55 60 
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Gly 


Ser 


Gly Pro Ala Ala 


His 


Thr 


Ala 


Ala 


He 


Tyr 


Ala Ser Arg 


Ala 


65 




70 










75 








80 


Glu 


Leu 


Lys Pro Leu Leu 


Phe 


Glu 


Gly 


Trp 


Met 


Ala 


Asn Asp 


He 


Ala 






85 








90 








95 




Pro 


Gly 


Gly Gin Leu Thr 


Thr 


Thr 


Thr 


Asp 


Val 


Glu 


Asn Phe 


Pro 


Gly 




100 






105 








110 






Phe 


Pro 


Glu Gly lie Leu 


Gly 


He 


Asp 


He 


Val 


Glu 


Lys Phe 


Arg 


Lys 






115 




120 










125 






Gin 


Ser 


Glu Arg Phe Gly 


Thr 


Thr 


He 


Phe 


Thr 


Glu 


Thr Val 


Asn 


Lys 




130 


135 










140 








Val 


Asp 


Phe Ser Ser Lys 


Pro 


Phe 


Lys 


Leu 


Phe 


Thr 


Asp Ser Arg 


Thr 


145 


150 










155 








160 


Val 


Leu 


Ala Asp Ser Val 


He 


He 


Ser 


Thr 


Gly 


Ala 


Val Ala 


Lys 


Arg' 






165 








170 








175 




Leu 


Ser 


Phe Thr Gly Ser 


Gly 


Glu 


Gly 


Asn 


Gly 


Gly 


Phe Trp Asn 


Arg 






' 180 






185 








190 






Gly 


lie 


Ser Ala Cys Ala 


Val 


Cys 


Asp 


Gly 


Ala 


Ala 


Pro He 


Phe 


Arg 




195 




200 










205 






Asn 


Lys 


Pro Leu Val Val 


He 


Gly 


Gly 


Gly 


Asp 


Ser 


Ala Met 


Glu 


Glu 




210 




215 










220 








Ala 


Asn 


Phe Leu Thr Lys 


Tyr 


Gly 


Ser 


Lys 


Val 


Tyr 


He He 


His 


Arg 


225 




230 










235 








240 


Arg 


Asp 


Thr Phe Arg Ala 


Ser 


Lys 


He 


Met 


Gin 


Gin 


Arg Ala 


Leu 


Ser 


245 








250 








255 




Asn 


Pro 


Lys lie Glu Val 


He 


Trp 


Asn 


Ser 


Ala 


Val 


Val Glu 


Ala 


Tyr 






260 






265 








270 






Gly 


Asp 


Glu Asn Gly Arg 


Val 


Leu 


Gly 


Gly 


Leu 


Lys 


Val Lys 


Asn 


Val 






275 




280 










285 






Val 


Thr 


Gly Asp Val Ser 


Asp 


Leu 


Lys 


Val 


Ser 


Gly 


Leu Phe 


Phe 


Ala 




290 


295 










300 








lie 


Gly 


His Glu Pro Ala 


Thr 


Lys 


Phe 


Leu 


Asp 


Gly 


Gin Leu 


Glu 


Leu 


305 


310 










315 








320 


Asp 


Glu 


Asp Gly Tyr Val 


Val 


Thr 


Lys 


Pro 


Gly 


Thr 


Thr Lys 


Thr 


Ser 




325 








330 








335 




Val 


Val 


Gly Val Phe Ala 


Ala 


Gly 


Asp 


Val 


Gin 


Asp 


Lys Lys Tyr 


Arg 






340 






345 








350 






Gin 


Ala 


lie Thr Ala Ala 


Gly 


Thr 


Gly 


Cys 


Met 


Ala 


Ala' Leu Asp 


Ala 






355 


360 






365 






Glu 


His 


Tyr Leu Gin Glu 


He 


Gly 


Ser 


Gin 


Glu 


Gly 


Lys Ser Asp 






370 


375 










380 









<210> 197 
<211> 323 
<212> PRT 

<213> Aquifex aeolicus 



<400> 197 



Met 


Ala 


Val 


Ser 


Leu 


Met 


Gin 


Gin 


Pro 


Asp 


Lys 


Val 


Tyr 


Asp Val 


He 


1 








5 










10 








15 




He 


lie 


Gly 


Ala 


Gly 


Pro 


Ala 


Gly 


Thr 


Thr 


Ala 


Ala 


He 


Tyr Thr 


Ala 






20 








25 










30 




Arg 


Ala 


Gly 


Trp 


Lys 


Thr 


Leu 


Val 


Leu 


Tyr 


Arg 


Ala 


Glu 


Ala Asp 


Gly 






35 










40 










45 






Ala 


Leu 


Gly 


Val 


Thr 


Gin 


Lys 


He 


Glu 


Asn 


Tyr 


Pro 


Gly 


Val Pro 


Gly 




50 








55 










60 








Pro 


Leu 


Ser 


Gly 


Tyr 


Glu 


Leu 


Leu 


Lys 


He 


Met 


Arg 


Glu 


Gin Ala 


Lys 


65 






70 








75 








80 


Ser 


Phe 


Gly 


Ala 


Glu 


Phe 


Val 


Arg 


Gly 


Lys 


Val 


He 


Ala 


Thr Asp 


Leu 










85 










90 








95 




Asn 


Ser 


Asp 


Pro 


Lys 


Lys 


Val 


Tyr 


Thr 


He 


Asp 


Gly 


Arg 


Glu Phe 


Arg 








100 










105 










110 




Gly 


Lys 


Thr 


He 


He 


Val 


Ala 


Ser 


Gly 


Ala 


Met 


Glu 


Arg 


Ala Asn 


Lys 


115 










120 










125 






Phe 


Lys 


Gly 


Glu 


Glu 


Glu 


Phe 


Leu 


Gly 


Arg 


Gly 


Val 


Ser 


Tyr Cys 


Gly 




130 










135 










140 








Val 


Cys 


Asp 


Ala 


Ala 


Phe 


Phe 


Lys 


Asp 


Gin 


Pro 


Val 


Ala 


Val He 


Gly 
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145 150 155 160 

Asp Asp Asp Tyr Ala lie Glu Glu Ala Glu Phe lie Ala Arg Phe Ala 

165 170 175 

Asn Lys Val Phe Phe Val Val Pro Gly Ser Lys lie Lys Ala Pro Pro 

180 185 190 

Glu Val lie Glu His Phe Glu Lys Leu Pro Asn Val Glu lie Leu Leu 

195 200 205 

Arg His Arg Pro lie Glu lie Val Gly Asp Gin Val Val Lys Gly lie 

210 215 220 

Lys Leu Lys Asp Leu Glu Lys Lys Glu Glu Lys Leu Leu Glu Val Asn 
225 230 235 240 

Gly Val Phe He Phe Leu Gly Gly Thr Lys Pro Ser Val Asp Phe Leu 

245 250 255 

Met Gly Gin Val Glu Met Thr Glu Gly Asp Cys He Val Val Asn Glu 

260 265 270 

Glu Met Met Thr Ser Val Pr.o Gly Val Phe Ala Ala Gly Asp Val Leu 

275 280 285 

Cys Asn Glu Val Lys Gin Ala Val Val Ala Ala Ala Met Gly Cys Lys 

290 295 300 

Ala Ala Leu Ala Val Asp Lys Phe Leu Ser Gly Lys Lys Lys He Val 
305 310 315 320 

Pro Gin Trp 



<210> 198 
<211> 315 
<212> PRT 

<213> Bacillus subtilis 
<400> 198 

Ser Glu Glu Lys He Tyr Asp Val He He He Gly Ala Gly Pro Ala 

1 5 10 15 

Gly Met Thr Ala Ala Val Tyr Thr Ser Arg Ala Asn Leu Ser Thr Leu 

20 ' 25 30 

Met He Glu Arg Gly He Pro Gly Gly Gin Met Ala Asn Thr Glu Asp 

35 * 40 45 

Val Glu Asn Tyr Pro Gly Phe Glu Ser He Leu Gly Pro Glu Leu Ser 

50 55 60 

Asn Lys Met Phe Glu His Ala Lys Lys Phe Gly Ala Glu Tyr Ala Tyr 
65 70 75 80 

Gly Asp He Lys Glu Val He Asp Gly Lys Glu Tyr Lys Val Val Lys 

85 90 95 

Ala Gly Ser Lys Glu Tyr Lys Ala Arg Ala Val He He Ala Ala Gly 

100 * ~ 105 110 

Ala Glu Tyr Lys Lys He Gly Val Pro Gly Glu Lys Glu Leu Gly Gly 

115 120 ^ 125 

Arg Gly Val Ser Tyr Cys Ala Val Cys Asp Gly Ala Phe Phe Lys Gly 

130 - 135 140 

Lys Glu Leu Val Val Val Gly Gly Gly Asp Ser Ala Val Glu Glu Gly 
145 150 155 160 

Val Tyr Leu Thr Arg Phe Ala Ser Lys Val Thr He Val His Arg Arg 

165 170 175 

Asp Lys Leu Arg Ala Gin Ser He Leu Gin Ala Arg Ala Phe Asp Asn 

180 185 ~ 190 

Glu Lys Val Asp Phe Leu Trp Asn Lys Thr Val Lys Glu lie' His Glu 

195 200 205 

Glu Asn Gly Lys Val Gly Asn Val Thr Leu Val Asp Thr Val Thr Gly 

210 215 220 

Glu Glu Ser Glu Phe Lys Thr Asp Gly Val Phe He Tyr He Gly Met 
225 230 235 240 

Leu Pro Leu Ser Lys Pro Phe Glu Asn Leu Gly He Thr Asn Glu Glu 

245 250 255 

Gly Tyr He Glu Thr Asn Asp Arg Met Glu Thr Lys Val Glu Gly He 

260 265 270 

Phe Ala Ala Gly Asp He Arg Glu Lys Ser Leu Arg Gin He Val Thr 
275 280 285 
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Ala Thr Gly Asp Gly Ser lie Ala Ala Gin Ser Val Gin His Tyr Val 

290 295 300 

Glu Glu Leu Gin Glu Thr Leu Lys Thr Leu Lys 
305 . 310 315 



<210> 199 

<211> 326 

<212> PRT 

<213> Borrelia burgdorferi 



<400> 199 



Met 


Leu 


Glu 


Phe 


Glu 


Thr 


He 


Asp 


1 








5 








Leu 


Ser 


Gin Lys 


Glu 


Val 


ASD 


Phe 








20 










Ser 


Gly 


Pro 


Ala 


Gly 


Leu 


Thr 


Ala 






35 








40 


Tvr 


Lvs 


Ala Ala 


He 


Leu 


Glu 


Gly 




50 










55 




Thr 


Thr 


Thr 


Glu 


Val 


Tvr 


Asn 


Tvr 

JL 


65 










70 






Glv 


Arg 


Asn 


Leu 


Met 
85 


Leu 


Asn 


Met 


Ala 


Lys 


Thr 


Phe 


Pro 


Glu 


Thr 


Val 






100 










lie 


Phe 


Tyr 
115 


Leu 


Tvr 


Thr 


Glu 


Asn 
120 


lie 


lie 
130 


Ala 


Val 


Glv 


Ser 


Lys 
135 


Pro 


Ser 


Glv 


Leu 


Phe 


Trn 


Asn 


Lys 


Glv 


145 










150 






Glv 


His 


Leu 


Phe 


Lvs 
165 


Glv 


Lvs 


Arcr 


Thr 


Ala 


Leu 


Ser 
180 


Glu 


Ser 


He 


Tyr 


Tyr 


Leu 


lie 
195 


Val 


Arg 


Lys 


Asn 


Asn 
200 


Asp 


Ser 
210 


Val 


Ala 


Lys 


Leu 


Pro 
215 


Asn 


Ala 


lie 


Glu Val 


Asp 


Gly 


Lys 


Ser 


225 










230 






Asn 


Lys 


Lys Asp 


Asn 


Val 


Val 


Tyr 










245 








Met 


Ala 


Val Gly 


Tyr 


Lys 


Pro 


Asn 








260 










Asp 


Leu 


Asp Glu 


Glu 


Gly 


Phe 


He 






275 










280 


Ser 


Val 


Asp Gly 


Val 


Phe 


Ser 


Cys 




290 










295 




Ala 


Gin 


Ala 


He 


Thr 


Ala 


Ala 


Ala 


305 










310 






Leu 


Gly 


Asn 


Phe 


Leu 


Lys 







325 



He Asn Leu 


Thr 


Lys 


Lys 


Lys 


Asn 


10 








15 




He Glu Asp 


Val 


He 


He 


Val 


Gly 


25 






30 






Gly He Tyr 


Ser 


Val 


Met 


Ser 


Asn 




45 








Pro Glu Pro 


Gly 


Gly 


Gin 


Leu 


Thr 




60 










Pro Gly Phe 


Lvs 


Asn 


Gly 


He 


Ser 


75 










80 


Arg Glu Gin 


Val 


Val 


Asn 


Leu 


Gly 


90 








95 




Phe Ser He 


Lvs 


Arcr 


Lvs 


Gly 


Asn 


105 






110 






Tvr He Tvr 


Lys 


Ser 


Lvs 


Ala 


Val 






125 








Lys Lys Leu 


Glu 


Thr 


Leu 


Lvs 


Asn 


140 










He Ser Val 


Cys 


Ala 


He 


Cys 


Asn 


155 










160 


Val Ala Val 


He 


Gly 


Gly 


Gly 


Asn 


170 








175 




Leu Ser Lys 


Leu 


Val 


Asp 


Lys 


Val 


185 






190 






Leu Arg Ala 


He 


Ala 


Met 


Leu 


Arg 




205 








He Glu He 


Leu 


Tyr 


Asn 


Ser 


Glu 




220 










Ser Val Ser 


Ser 


Val 


Lys 


He 


Phe 


235 










240 


Glu Leu Glu 


Val 


Ser 


Ala 


Val 


Phe 


250 








255 




Thr Glu Phe 


Leu 


Lys 


Gly 


Phe 


Leu 


265 






270 






Val Thr Lys 


Asp 


Val 


Val 


Lys 


Thr 






285 








Gly Asp Val 


Ser 


Asn 


Lys 


Leu 


Tyr 




300 










Glu Gly Phe 


He 


Ala 


Ser 


Val 


Glu 


315 










320 



<210> 200 
<211> 319 
<212> PRT 

<213> Buchnera aphidicola 
<400> 200 

Met Asp Lys Val Lys His Ser Lys He He He Leu Gly Ser Gly Pro 

1 " * 5 10 15 

Ala Gly Tyr Thr Ala Ala He Tyr Ala Ala Arg Ala Asn Leu Asp Pro 

20 25 30 

Phe Leu He Thr Gly Thr Asn Lys Gly Gly Gin Leu Met Asn Thr Asn 
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35 




40 


45 




Glu 


He 


Glu Asn 


Trp Pro 


Gly Asp 


Tyr Asn Lys He Ser Gly Ser 


Glu 




50 






55 


60 




Leu 


Met 


Asn Arg 


Met Tyr 


Lys His 


Ala He Glu Leu Lys Thr Lys 


Val 


65 






70 




75 


80 


He 


Cys 


Asp Thr 


Val He 


Ser Val 


Asn Phe Lys Lys Asn Pro Phe 


Phe 






85 




90 1 " 95 




Leu 


He 


Gly Glu 


Asn Asn 


Lys Tyr 


Thr Ala Asp Ser Val He He 


Ala 






100 






105 ^ 110 




Thr 


Gly 


Ala Asn 


Pro Arg 


Tyr Leu 


Gly Leu Gin Ser Glu Ser Leu 


Phe 






115 




120 


125 




Lys 


Gly 


Lys Gly 


Val Ser 


Thr Cys 


Ala Val Cys Asp Gly Phe Phe 


Tyr 




130 






135 


140 




Lys 


Asn 


Lys Glu 


Val Ala 


Val Val 


Gly Gly Gly Asn Thr Ala He 


Glu 


145 






150 




155 


160 


Glu 


Thr 


Leu Tyr 


Leu Ser 


Asn Phe 


Val Lys Lys Val His Leu He 


His 






165 




170 175 




Arg 


Gly 


He Asn 


Phe Arg 


Ala Glu 


Lys He Leu Leu Asp Arg Leu 


Glu 






180 






185 " 190 




Lys 


Lys 


He Lys 


Ser Gin 


Lys He 


He He Tyr Leu Asn Ser He 


Val 






195 




200 


205 




Lys 


Asn 


He Leu 


Gly Asn 


Ser Ser 


Gly Val Thr Ala Leu Leu He 


Glu 




210 




215 


220 




Gin 


Lys 


Asn Ser 


Lys Glu 


Lys Thr 


Glu Ser Lys He Gin Val Ser 


Gly 


225 






. 230 




235 


240 


Leu 


Phe 


Val Ala 


He Gly 


Tyr Thr 


Pro Asn Thr Asn He Phe Val 


Asn 








245 


250 255 




Lys 


Leu 


Lys Met 


Lys Asp 


Gly Tyr 


He Gin Val Thr Arg Gin Glu 


His 






260 






265 270 




Gly 


Asn 


Tyr Thr 


Gin Thr 


Ser He 


Pro Gly He Phe Ala Ala Gly 


Asp 






275 




280 


285 




Val 


He 


Asp His 


Val Tyr 


Arg Gin 


Ala He Thr Ser Ser Ala Ser 


Gly 




290 






295 


300 




Cys 


Met 


Ala Ala 


Leu Asp 


Ser Glu 


Arg Tyr He Asn Ser Leu Val 




305 






310 




315 





<210> 201 
<211> 319 
<212> PRT 

<213> Buchnera aphidicola 
<400> 201 



Met 


Glu 


Leu 


Lys 


Asn 


His 


Lys Lys 


He He He Leu Gly Ser Gly 


Pro 


1 








5 






10 


15 




Ala 


Gly 


Tyr 


Thr 


Ala 


Ala 


lie Tyr 


Ser Ser Arg Ala Asn Leu Asn 


Pro 






20 








25 30 






Leu 


Leu 


He 


Thr 


Gly 


He 


Asn Lys 


Gly Gly Gin Leu Met Asn Thr 


Asn 






35 








40 


45 






Glu 


He 


Glu 


Asn 


Trp 


Pro 


Gly Asp 


Phe Lys Lys He Thr Gly Pro 


Glu 




50 










55 


60 






Leu 


Met 


Asn 


Arg 


Met 


His 


Glu His 


Ser Leu Lys Phe Lys Thr Glu 


He 


65 










70 




75 




80 


Val 


Tyr 


Asp 


Asn 


He 
85 


He 


Ser Val 


Glu Phe Lys Lys Lys Pro 
90 


Phe 
95 


Phe 


Leu 


Leu 


Gly 


Glu 
100 


Tyr 


Asn 


Lys Tyr 


Thr Cys Asp Ala Val lie- 
105 110 


He 


Ala 


Thr 


Gly 


Ala 
115 


Asn 


Pro 


Arg 


Tyr Leu 
120 


Gly Leu Ser Ser Glu Asn 
125 


Lys 


Phe 


Lys 


Gly 
130 


Lys 


Gly 


He 


Ser 


Thr Cys 
135 


Ala Val Cys Asp Gly Phe 
140 


Phe 


Tyr 


Lys 


Asn 


Lys 


Glu 


He 


Ala 


Val Val 


Gly Gly Gly Asn Thr Ala 


He 


Glu 


145 








150 




155 




160 


Glu 


Thr 


Leu 


Tyr 


Leu 


Ser 


Asn Phe 


Val Lys Lys He Tyr Leu 


He 


His 








165 






170 


175 




Arg 


Arg 


Asn 


Asn 
180 


Phe 


Lys 


Ala Glu 


Lys He Leu He Asp Arg 
185 190 


Leu 


Leu 
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Lys 


He 


Val Lys 


Thr Lys 


Lys Val 


He 






195 




200 




Glu 


Asp 


He Leu 


Gly Asn 


Asn Lys 


Gly 




210 






215 




Asn 


Lys 


Asn Leu 


Lys Glu 


Lys Lys 


Lys 


225 




230 






Leu 


Phe 


Val Ala 


He Gly 


Tyr He 


Pro 








245 






Gin 


Leu 


Lys Met 


Lys Asp 


Gly Tyr 


He 






260 






265 


Gly 


Asn 


Tyr Thr 


Gin Thr 


Asn He 


Pro 






275 




280 




Val 


He 


Asp His 


Val Tyr 


Arg Gin 


Ala 




290 






295 




Cys 


Met 


Ala Ala 


Leu Asp 


Ser Glu 


Arg 


305 






310 







<210> 202 
<211> 312 
<212> PRT 

<213> Chlamydia muridarum 
<400> 202 



Met Thr 


His Val 


Lys Leu 


Ala He 


He 


1 

Thr Ala 


Ala He 


5 

Tyr Ala 


Ser Arg 


Ala 




20 






25 


Glu Gly 


Phe Phe 


Ser Gly 


He Ala 


Glv 




35 




40 




Glu Val 


Glu Asn 


Phe Pro 


Gly Phe 


Pro 


50 






55 




Leu Met 


Glu Asn 


Met Lys 


Met Gin 


Ala 


65 




70 






He Ala 


Lys Asp 


He Thr 


Ser Val 


Asp 






85 






Leu Lys 


Ser Gly 


Glu Asp 


Thr Phe 


Thr 




100 






105 


Thr Gly 


Ala Ser 


Ala Lys 


Arg Leu 


Ser 




115 




120 




Glu Phe 


Trp Gin 


Lys Gly 


Val Thr 


Ala 


130 






135 




Ser Pro 


He Phe 


Arg Asp 


Arg Asp 


Leu 


145 




150 






Ser Ala 


Leu Glu 


Glu Ala 


Met Phe 


Leu 






165 






Phe Val 


Val His 


Arg Arg 


Asp Thr 


Leu 




180 






185 


Asn Lys 


Ala Gin 


Ala Asn 


Glu Lys 


He 


195 




200 




Val Val 


Lys He 


Leu Gly 


Asp Ser 


Leu 


210 






215 




Asn Asn 


Val Glu 


Lys Thr 


Thr Val 


Thr 


225 




230 






Phe Ala 


He Gly 


His Gin 


Pro Asn 


Thr 




245 






Ser Leu 


Asp Glu 


Asn Gly 


Tyr He 


He 




260 






265 


Thr Ser 


Val Pro 


Gly Val 


Phe Ala 


Ala 




275 




280 




Tyr Arg 


Gin Ala 


He Thr 


Ser Ala 


Gly 


290 






295 




Asp Ala 


Glu Arg 


Phe Leu 


Glu Lys 




305 




310 







<210> 203 



PCT/US01/50240 



Leu His 


Leu 


Asn Ser Thr 


He 






205 




Val Thr 


His 


Leu Leu He 


Lys 




220 






Leu Lys 


He 


Ala Val Ser 


Gly 


235 






240 


Asn Thr 


Asp 


He Phe Thr 


Asp 


250 




255 




Lys lie 


Lys 


Lys Gly Thr 


His 






270 




Gly Val 


Phe 


Ala Ala Gly 


Asp 






285 




lie Thr 


Ser 


Ser Ala Ser 


Gly 




300 






Tyr Leu 


Asn 


Ser Leu Ser 




315 








Gly Ser 


Gly 


Pro Ala Gly 


Tyr 


10 




15 




Leu Leu 


Thr 


Pro He Leu 


Phe 






30 




Gly Gin 


Leu 


Met Thr Thr 


Thr 




45 




Gin Gly 


Val 


Leu Gly His 


Gin 




60 






Gin Arg 


Phe 


Gly Thr Gin 


Val 


75 






80 


Phe Ser 


Val 


Arg Pro Phe 


Val 


90 




95 




Cys Asp 


Ala 


Cys He He 


Ala 






110 




He Pro 


Gly 


Ala Gly Asp 


Asn 






125 




Cys Ala 


Val 


Cys Asp Gly 


Ala 




140 






Phe Val 


He 


Gly Gly Gly 


Asp 


155 






160 


Thr Arg 


Tyr 


Gly Lys Arg 


Val 


170 




175 




Arg Ala 


Ser 


Lys Ala Met 


Val 






190 




Val Phe 


Leu 


Trp Asn Ser 


Glu 






205 




Val Arg 


Ser 


He Asp He 


Phe 




220 




Met Glu 


Ala 


Ala Gly Val 


Phe 


235 






240 


Ala Phe 


Leu 


Gly Gly Gin 


Leu 


250 




' 255 




Thr Glu 


Lys 


Gly Ser Ser 


Arg 






270 




Gly Asp 


Val 


Gin Asp Lys 


Tyr 






285 




Ser Gly 


Cys 


Met Ala Ala 


Leu 




300 
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<211> 311 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 203 

Met lie His Ser Arg Leu lie lie lie Gly Ser Gly Pro Ser Gly Tyr 

15 10 15 

Thr Ala Ala lie Tyr Ala Ser Arg Ala Leu Leu His Pro Leu Leu Phe 

20 ~ 25 30 

Glu Gly Phe Phe Ser Gly lie Ser Gly Gly Gin Leu Met Thr Thr Thr 

35 40 45 

Glu Val Glu Asn Phe Pro Gly Phe Pro Glu Gly He Leu Gly Pro Lys 

50 55 - 60 

Leu Met Asn Asn Met Lys Glu Gin Ala Val Arg Phe Gly Thr Lys Thr 
65 70 75 80 

Leu Ala Gin Asp He He Ser Val Asp Phe Ser Val Arg Pro Phe He 

85 90 95 

Leu Lys Ser Lys Glu Glu Thr Tyr Ser Cys Asp Ala Cys He He Ala 

100 105 110 

Thr Gly Ala Ser Ala Lys Arg Leu Glu He Pro Gly Ala Gly Asn Asp 

115 ~ 120 125 

Glu Phe Trp Gin Lys Gly Val Thr Ala Cys Ala Val Cys Asp Gly Ala 

130 135 140 

Ser Pro He Phe Lys Asn Lys Asp Leu Tyr Val He Gly Gly Gly Asp 
145 150 ~ 155 160 

Ser Ala Leu Glu Glu Ala Leu Tyr Leu Thr Arg Tyr Gly Ser His Val 

165 170 175 

Tyr Val Val His Arg Arg Asp Lys Leu Arg Ala Ser Lys Ala Met Glu 

180 ~ ~ <s 185 190 

Ala Arg Ala Gin Asn Asn Glu Lys He Thr Phe Leu Trp Asn Ser Glu 

195 200 205 

He Val Lys He Ser Gly Asp Ser He Val Arg Ser Val Asp He Lys 

210 "* 215 220 

Asn Val Gin Thr Gin Glu He Thr Thr Arg Glu Ala Ala Gly Val Phe 
225 230 235 240 

Phe Ala He Gly His Lys Pro Asn Thr Asp Phe Leu Gly Gly Gin Leu 

245 "* 250 255 

Thr Leu Asp Glu Ser Gly Tyr He Val Thr Glu Lys Gly Thr Ser Lys 

260 265 270 

Thr Ser Val Pro Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Tyr 

275 280 285 

Tyr Arg Gin Ala Val Thr Ser Ala Gly Ser Gly Cys He Ala Ala Leu 

290 295 300 

Asp Ala Glu Arg Phe Leu Gly 
305 310 



<210> 204 
<211> 312 
<212> PRT 

<213> Chlamydia trachomatis 



<400> 204 



Met Thr 


His 


Ala 


Lys 


Leu 


Val He 


1 






5 






Thr Ala 


Ala 


He 


Tyr 


Ala 


Ser Arg 






20 








Glu Gly 


Phe 


Phe 


Ser 


Gly 


He Ala 




35 








40 


Glu Val 


Glu 


Asn 


Phe 


Pro 


Gly Phe 


50 










55 


Leu Met 


Asp 


Leu 


Met 


Lys 


Thr Gin 


65 






70 




Leu Ser 


Lys 


Asp 


He 


Thr 


Ala Val 








85 






Leu Lys 


Ser 


Gly 


Lys 


Glu 


Thr Phe 






100 









He Gly Ser Gly Pro Ala Gly Tyr 

10 15 
Ala Leu Leu Thr Pro Val' Leu Phe 
25 30 
Gly Gly Gin Leu Met Thr Thr Thr 
45 

Pro Glu Gly Val Leu Gly His Gin 
60 

Ala Gin Arg Phe Gly Thr Gin Val 

75 80 
Asp Phe Ser Val Arg Pro Phe Val 

90 95 
Thr Cys Asp Ala Cys He He Ala 
105 110 
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Thr 


Gly 


Ala 


Ser Ala 


Lvs 

*-*J o 


Arcr 


Leu 


Ser He 


Pro Gly Ala Gly Asp Asn 




115 




120 




125 


Glu 


Phe 


Trp 


Gin Lys 


Gly 


Val 


Thr 


Ala Cys 


Ala Val Cys Asp Gly Ala 




130 








135 






140 


Ser 


Pro 


He 


Phe Arg 


Asp 


Lys 


Asp 


Leu Phe 


Val Val Gly Gly Gly Asp 


145 








150 








155 160 


Ser 


Ala 


Leu 


Glu Glu 


Ala 


Met 


Phe 


Leu Thr 


Arg Tyr Gly Lys Arg Val 








165 








170 


175 


Phe 


Val 


Val 


His Arg 


Arg 


Asp 


Thr 


Leu Arg 


Ala Ser Lys Val Met Val 








180 






185 


190 


Asn 


Lys 


Ala 


Gin Ala 


Asn 


Glu 


Lys 


He Phe 


Phe Leu Trp Asn Ser Glu 






195 








200 




205 


He 


Val 


Lys 


He Ser 


Gly 


Asp 

XT 


Thr 


Leu Val 


Arg Ser He Asp He Tyr 




210 








215 






220 


Asn 


Asn 


Val 


Asp Glu 


Thr 


Thr 


Thr 


Thr Met 


Glu Ala Ala Gly Val Phe 


225 






230 








235 240 


Phe 


Ala 


He 


Gly His 


Gin 


Pro 


Asn 


Thr Ala 


Phe Leu Gly Gly Gin Val 








245 








250 


255 


Ala 


Leu 


Asp 


Glu Asn 


Gly 


Tyr 


He 


He Thr 


Glu Lys Gly Ser Ser Arg 








260 






265 


270 


Thr 


Ser 


Val 


Pro Gly 


Val 


Phe 


Ala 


Ala Gly 


Asp Val Gin Asp Lys Tyr 






275 








280 




285 


Tyr 


Arg 


Gin 


Ala He 


Thr 


Ser 


Ala 


Gly Ser 


Gly Cys Met Ala Ala Leu 




290 








295 






300 


Asp 


Ala 


Glu 


Arg Phe 


Leu 


Glu 


Asn 







305 310 



<210> 205 
<211> 315 
<212> PRT 

<213> Clostridium litorale 



<400> 205 



Met 


Glu 


Asn 


Val 


Tyr 


Asp 


He 


Ala 


He He 


Gly Ser Gly Pro Ala Gly 


1 








5 








10 


15 


Leu 


Ala 


Ala 


Ala 


Leu 


Tyr 


Gly 


Ala 


Arg Ala 


Lys Met Lys Thr Leu Leu 








20 








25 


30 


Leu 


Glu 


Gly 


Met 


Lys 


Val 


Gly 


Gly 


Gin He 


Val He Thr His Glu Val 






35 








40 




45 


Ala 


Asn 


Tyr 


Pro 


Gly 


Ser 


Val 


Pro 


Glu Ala 


Thr Gly Pro Ser Leu He 




50 






55 






60 


Gly 


Arg 


Met 


Glu 


Glu 


Gin 


Val 


Glu 


Glu Phe 


Gly Ala Glu Arg Val Met 


65 








70 








75 80 


Asp 


Asn 


He 


Val 


Asp 


Val 


Asp 


Phe 


Thr Asp 


Lys He Lys Val Leu Lys 










85 








90 


95 


Gly 


Ala 


Lys 


Gly 


Glu 


Tyr 


Lys 


Ala 


Lys Ala 


Val He Val Ala Thr Gly 








100 










105 


110 


Ala 


Ser 


Pro 


Lys 


Leu 


Ala 


Gly 


Cys 


Pro Gly 


Glu Lys Glu Leu Thr Gly 






115 










120 




125 


Lys 


Gly 


Val 


Ser 


Tyr 


Cys 


Ala 


Thr 


Cys Asp 


Ala Asp Phe Phe Glu Asp 




130 










135 






140 


Met 


Glu 


Val 


Phe 


Val 


He 


Gly 


Gly 


Gly Asp 


Thr Ala Val Glu Glu Ala 


145 










150 






155 160 


Met 


Phe 


Leu 


Thr 


Lys 


Phe 


Ala 


Arg 


Lys Val 


Thr He Val His Arg Arg 










165 








170 


' 175 


Ala 


Glu 


Leu 


Arg 


Ala 


Ala 


Lys 


Ser 


He Gin 


Glu Lys Ala Phe Lys Asn 








180 








185 


190 


Glu 


Lys 


Leu 


Asn 


Phe 


Met 


Trp 


Asn 


Thr Val 


He Glu Glu He Lys Gly 






195 








200 




205 


Asp 


Gly 


He 


Val 


Glu 


Ser 


Ala 


Val 


Phe Lys 


Asn Arg Glu Thr Gly Glu 




210 










215 






220 


Val 


Thr 


Glu 


Phe 


Val 


Ala 


Pro 


Glu 


Glu Asp 


Gly Thr Phe Gly He Phe 


225 










230 








235 240 


Val 


Phe 


He 


Gly 


Tyr 


Asp 


Pro 


Lys 


Ser Ala 


Leu Val Glu Gly Lys Leu 










245 








250 


255 


Glu 


Leu 


Asp 


Glu 


Thr 


Gly 


Tyr 


He 


Pro Thr 


Asp Asp Asn Met Lys Thr 
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260 265 270 

Asn Val Glu Gly Val Phe Ala Ala Gly Asp He Arg Val Lys Ser Leu 

275 280 285 

Arg Gin Val Val Thr Ala Thr Ala Asp Gly Ala He Ala Ala Val Gin 

290 295 300 

Ala Glu Lys Tyr He Glu Glu Leu Phe Ala Glu 
305 310 315 



<210> 206 
<211> 321 
<212> PRT 

<213> Coxiella burnetii 
<400> 206 

Met Asn Lys Pro Gin His His Ser Leu He He Leu Gly Ser Gly Pro 

1 5 10 15 

Ala Gly Tyr Thr Asp Ala He Tyr Val Ala Arg Ala Asn Leu Lys Pro 

20 25 30 

He Met He Thr Gly Met Glu Gin Gly Gly Gin Leu Met Thr Thr Thr 

35 40 45 

Asp Val Ala Asn Trp Pro Gly Glu Ala Pro Gly Leu Gin Gly Pro Lys 

50 55 60 

Leu Leu Glu Arg Met Gin Lys His Ala Gly Gly Ala Leu Asn Thr Gin 
-65 70 75 80 

Phe He Phe Asp His He Asn Lys Pro Asp Leu Asn Pro Arg Pro Phe 

85 90 95 

Leu Leu Gin Gly Asp Asn Ala Thr Tyr Ser Cys Asp Ala Leu He He 

100 105 110 

Ala Thr Gly Ala Ser Ala Arg Tyr Leu Gly Leu Pro Ser Glu Lys Pro 

115 120 125 

Tyr Met Gly Lys Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe 

130 135 140 

Tyr Arg Ala Lys Lys Val Ala Val Val Gly Gly Gly Asn Thr Ser Val 
145 150 155 160 

Glu Glu Ala Leu Tyr Leu Ser His He Ala Ser His Val Thr Leu He 

165 170 175 

His Arg Arg Asp Lys Leu Arg Ala Glu Lys Met Leu Ser Ala Gin Leu 

180 " 185 * 190 

He Lys Lys Val Glu Glu Gly Lys Val Ala He Val Trp Ser His Val 

195 200 205 

He Glu Glu Val Leu Gly Asp Asp Gin Gly Val Thr Gly Val His Leu 

210 215 220 

Lys His Val Lys Glu Glu Lys Thr Gin Asp Leu Thr He Asp Gly Leu 
225 230 235 240 

Phe He Ala He Gly His Asp Pro Asn Thr Lys He Phe Lys Glu Gin 

245 250 255 

Leu Glu Met Asp Glu Ala Gly Tyr Leu Arg Ala Lys Ser Gly Leu Gin 

260 ~ 265 270 

Gly Asn Ala Thr Ala Thr Asn He Pro Gly Val Phe Pro Ala Val Val 

275 280 285 

Val Arg Gly Gin Leu Tyr Arg Gin Thr He Ala Ala Ala Gly Met Gly 

290 295 300 

Cys Met Pro Ala Leu Asp Ala Glu Arg Tyr Leu Asp Ser Leu Asn Gin 
305 310 315 320 

Ala 



<210> 207 
<211> 320 
<212> PRT 

<213> Escherichia coli 
<400> 207 

Gly Thr Thr Lys His Ser Lys Leu Leu He Leu Gly Ser Gly Pro Ala 
1 5 10 15 
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Gly Tyr 

Leu lie 

Val Glu 

50 
Met Glu 
65 

Phe Asp 

Asn Gly 

Gly Ala 

Gly Arg 
130 
Asn Gin 
145 

Ala Leu 

Arg Asp 

Lys Val 

Glu Val 
210 
Thr Gin 
225 

Val Ala 

Glu Leu 

Ala Thr 

Asp His 
290 
Ala Ala 
305 



Thr Ala Ala 
20 

Thr Gly Met 
35 

Asn Trp Pro 

Arg Met His 

His lie Asn 
85 

Asp Asn Gly 

100 
Ser Ala Arg 
115 

Gly Val Ser 

Lys Val Ala 

Tyr Leu Ser 
165 

Gly Phe Arg 

180 
Glu Asn Gly 
195 

Thr Gly Asp 

Asn Ser Asp 

He Gly His 
245 

Glu Asn Gly 

260 
Gin Thr Ser 
275 

He Tyr Arg 
Leu Asp Ala 



Val Tyr 

Glu Lys 

Gly Asp 

55 
Glu His 
70 

Lys Val 

Glu Tyr 

Tyr Leu 

Ala Cys 
135 
Val He 
150 

Asn He 

Ala Glu 

Asn He 

Gin Met 
215 
Asn He 
230 

Ser Pro 

Tyr lie 

lie Pro 

Gin Ala 
295 
Glu Arg 
310 



Ala Ala 

25 
Gly Gly 
40 

Pro Asn 

Ala Thr 

Asp Leu 

Thr Cys 
105 
Gly Leu 
120 

Ala Thr 

Gly Gly 

Ala Ser 

Lys He 
185 
He Leu 
200 

Gly Val 

Glu Ser 

Asn Thr 

Lys Val 
265 
Gly Val 
280 

He Thr 
Tyr Leu 



Arg Ala Asn Leu 
Gin Leu 
Asp Leu 



Lys Phe 

75 
Gin Asn 
90 

Asp Ala 

Pro Ser 

Cys Asp 

Gly Asn 
155 
Glu Val 
170 

Leu He 

His Thr 

Thr Gly 

Leu Asp 
235 
Ala He 
250 

Gin Ser 

Phe Ala 

Ser Ala 

Asp Gly 
315 



Thr Thr 

45 
Thr Gly 
60 

Glu Thr 



Arg Pro 

Leu He 

Glu Glu 
125 
Gly Phe 
140 

Thr Ala 

His Leu 

Lys Arg 

Asn Arg 
205 
Val Arg 
220 

Val Ala 

Phe Glu 

Gly He 

Ala Gly 
285 
Gly Thr 
300 

Leu Ala 



Gin Pro Val 
30 

Thr Thr Glu 

Pro Leu Leu 

Glu He He 
80 

Phe Arg Leu 
95 

He Ala Thr 
110 

Ala Phe Lys 

Phe Tyr Arg 

Val Glu Glu 
160 

He His Arg 

175 
Leu Met Asp 
190 

Thr Leu Glu 

Leu Arg Asp 

Gly Leu Phe 
240 

Gly Gin Leu 

255 
His Gly Asn 
270 

Asp Val Met 

Gly Cys Met 

Asp Ala Lys 
320 



<210> 208 
<211> 315 
<212> PRT 

<213> Eubacterium acidaminophilum 



<400> 208 



Met 


Glu 


Asn 


Val 


Tyr 


Asp 


Leu 


Ala 


He He Gly Ser Gly Pro Ala Gly 


1 








5 








10 15 


Leu 


Ala 


Ala 


Ala 


Leu 


Tyr 


Gly 


Ala 


Arg Ala Lys Met Lys Thr He Met 








20 










25 30 


He 


Glu 


Gly 


Gin 


Lys 


Val 


Gly 


Gly 


Gin He Val He Thr His Glu Val 






35 










40 


45 


Ala 


Asn 


Tyr 


Pro 


Gly 


Ser 


Val 


Arg 


Glu Ala Thr Gly Pro Ser Leu He 




50 










55 




60 


Glu 


Arg 


Met 


Glu 


Glu 


Gin 


Ala 


Asn 


Glu Phe Gly Ala Glu Lys Val Met 


65 










70 






75 ' 80 


Asp 


Lys 


He 


Val 


Asp 


Val 


Asp 


Leu 


Asp Gly Lys He Lys Val He Lys 










85 








90 95 


Gly 


Glu 


Lys 


Ala 


Glu 


Tyr 


Lys 


Ala 


Lys Ser Val He Leu Ala Thr Gly 








100 










105 110 


Ala 


Ala 


Pro 


Arg 


Leu 


Ala 


Gly 


Cys 


Pro Gly Glu Gin Glu Leu Thr Gly 






115 








120 


125 


Lys 


Gly 


Val 


Ser 


Tyr 


Cys 


Ala 


Thr 


Cys Asp Ala Asp Phe Phe Glu Asp 




130 










135 




140 


Met 


Glu 


Val 


Phe 


Val 


Val 


Gly 


Gly 


Gly Asp Thr Ala Val Glu Glu Ala 


145 










150 


155 160 


Met 


Tyr 


Leu 


Ala 


Lys 


Phe 


Ala 


Arg 


Lys Val Thr He Val His Arg Arg 
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165 



Asp 


Glu 


Leu Arg Ala 


Ala 


Lys 


Ser 






180 








Pro 


Lys 


Leu Asp Phe 


Met 


Trp 


Asn 






195 






200 


Asp 


Gly 


He Val Glu 


Ser 


Ala 


Val 




210 






215 




Thr 


Thr 


Glu Tyr Phe 


Ala 


Asn 


Glu 


225 






230 






Val 


Phe 


He Gly Tyr 


He 


Pro 


Lys 






245 








Thr 


Leu 


Asp Asp Ala 


Gly 


Tyr 


He 






260 








Asn 


Val 


Glu Gly Val 


Phe 


Ala 


Ala 






275 






280 


Arg 


Gin 


Val Val Thr 


Ala 


Cys 


Ala 




290 






295 




Ala 


Glu 


Lys Tyr Val 


Glu 


Ala 


Asn 


305 






310 







170 175 
He Gin Glu Lys Ala Phe Lys Asn 
185 190 
Ser Ala lie Glu Glu He Lys Gly 
205 

Phe Lys Asn Leu Val Thr Gly Glu 
220 

Glu Asp Gly Thr Phe Gly He Phe 
235 240 
Ser Asp Val Phe Lys Gly Lys He 

250 255 
He Thr Asp Asp Asn Met Lys Thr 
265 270 
Gly Asp He Arg Val Lys Ser Leu 
285 

Asp Gly Ala He Ala Ala Thr Gin 
300 

Phe Glu Glu 
315 



<210> 209 
<211> 318 
<212> PRT 

<213> Haemophilus influenzae 
<400> 209 

Met Ser Asp He Lys His Ala Lys Leu Leu He Leu Gly Ser Gly Pro 

15 10 15 

Ala Gly Tyr Thr Ala Ala He Tyr Ala Ala Arg Ala Asn Leu Lys Pro 

20 25 30 

Val Leu Val Thr Gly Leu Gin Gin Gly Gly Gin Leu Thr Thr Thr Asp 

35 40 45 

Glu lie Glu Asn Trp Pro Gly Asp Phe Glu Met Thr Thr Gly Ser Gly 

50 55 60 

Leu Met Gin Arg Met Leu Gin His Ala Glu Lys Phe Glu Thr Glu He 
65 70 75 80 

Val Phe Asp His He Asn Arg Val Asp Leu Ser Ser Arg Pro Phe Lys 

85 90 95 

Leu Phe Gly Asp Val Gin Asn Phe Thr Cys Asp Ala Leu He He Ala 

100 105 110 

Thr Gly Ala Ser Ala Arg Tyr He Gly Leu Pro Ser Glu Glu Asn Tyr 

115 120 125 

Lys Gly Arg Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe Tyx 

130 135 140 

Arg Asn Lys Pro Val Gly Val He Gly Gly Gly Asn Thr Ala Val Glu 
145 150 155 160 

Glu Ala Leu Tyr Leu Ala Asn He Ala Ser Thr Val . His Leu He His 

165 170 175 

Arg Arg Asp Ser Phe Arg Ala Glu Lys He Leu He Asp Arg Leu Tyr 

180 185 190 

Lys Lys Val Glu Glu Gly Lys He Val Leu His Thr Asp Arg Thr Leu 

195 200 205 

Asp Glu Val Leu Gly Asp Asn Met Gly Val Thr Gly Leu Arg Leu Ala 

210 215 220 

Asn Thr Lys Thr Gly Glu Lys Glu Glu Leu Lys Leu Asp Gly- Leu Phe 
225 230 235 240 

Val Ala He Gly His Ser Pro Asn Thr Glu He Phe Gin Gly Gin Leu 

245 250 255 

Glu Leu Asn Asn Gly Tyr He Val Val Lys Ser Gly Leu Asp Gly Asn 

260 265 270 

Ala Thr Ala Thr Ser Val Glu Gly Val Phe Ala Ala Gly Asp Val Met 

275 . 280 285 

Asp His Asn Tyr Arg Gin Ala He Thr Ser Ala Gly Thr Gly Cys Met 

290 295 300 

Ala Ala Leu Asp Ala Glu Arg Tyr Leu Asp Ala Gin Glu Ala 
305 310 315 
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<210> 210 
<211> 311 
<2l2> PRT 

<213> Helicobacter pylori 
<400> 210 

Met lie Asp Cys Ala lie lie Gly Gly Gly Pro Ala Gly Leu Ser Ala 

15 10 15 

Gly Leu Tyr Ala Thr Arg Gly Gly Val Lys Asn Ala Val Leu Phe Glu 

20 25 30 

Lys Gly Met Pro Gly Gly Gin lie Thr Gly Ser Ser Glu lie Glu Asn 

35 40 45 

Tyr Pro Gly Val Lys Glu Val Val Ser Gly Leu Asp Phe Met Gin Pro 

50 55 - 60 

Trp Gin Glu Gin Cys Phe Arg Phe Gly Leu Lys His Glu Met Thr Ala 
65 70 75 80 

lie Gin Arg Val Ser Lys Lys Gly Ser His Phe Val lie Leu Ala Glu 

85 90 95 

Asp Gly Lys Thr Phe Glu Ala Lys Ser Val lie lie Ala Thr Gly Gly 

100 105 110 

Ser Pro Lys Arg Thr Gly lie Lys Gly Glu Ser Glu Tyr Trp Gly Lys 

115 120 125 

Gly Val Ser Thr Cys Ala Thr Cys Asp Gly Phe Phe Tyr Lys Asn Lys 

130 135 140 

Glu Val Ala Val Leu Gly Gly Gly Asp Thr Ala Val Glu Glu Ala He 
145 150 155 160 

Tyr Leu Ala Asn He Cys Lys Lys Val Tyr Leu He His Arg Arg Asp 

165 170 175 

Gly Phe Arg Cys Ala Pro He Thr Leu Glu His Ala Lys Asn Asn Ser 

180 185 190 

Lys He Glu Phe Leu Thr Pro Tyr Val Val Glu Glu He Lys Gly Asp 

195 200 205 

Ala Ser Gly Val Ser Ser Leu Ser He Lys Asn Thr Ala Thr Asn Glu 

210 215 220 

Lys Arg Glu Leu Val Val Pro Gly Leu Phe He Phe Val Gly Tyr Asp 
225 230 " 235 240 

Val Asn Asn Ala Val Leu Lys Gin Glu Asp Asn Ser Met Leu Cys Glu 

245 250 255 

Cys Asp Glu Tyr Gly Ser He Val Val Asp Phe Ser Met Lys Thr Asn 

260 265 270 

Val Gin Gly Leu Phe Ala Ala Gly Asp He Arg He Phe Ala Pro Lys 

275 280 " 285 

Gin Val Val Cys Ala Ala Ser Asp Gly Ala Thr Ala Ala Leu Ser Val 

290 295 300 

He Ser Tyr Leu Glu His His 
305 310 



<210> 211 
<211> 311 
<212> PRT 

<213> Helicobacter pylori 



<400> 211 



Met He Asp 


Cys 


Ala 


He 


He Gly 


1 




5 






Gly Leu Tyr 


Ala 


Thr 


Arg 


Gly Gly 




20 








Lys Gly Met 


Pro 


Gly 


Gly 


Gin He 


35 








40 


Tyr Pro Gly 


Val 


Lys 


Glu 


Val Val 


50 








55 


Trp Gin Glu 


Gin 


Cys 


Phe 


Arg Phe 


65 






70 




Val Gin Arg 


Val 


Ser 


Lys 


Lys Asp 



85 



Gly 


Gly Pro Ala 


Gly Leu Ser Ala 




10 






15 




Val 


Lys Asn Ala 


Val 


Leu 


Phe 


Glu 


25 




30 






Thr 


Gly Ser Ser 


Glu 


He 


Glu 


Asn 




45 








Ser 


Gly Leu Asp 


Phe 


Met 


Gin 


Pro 




60 










Gly 


Leu Lys His 


Glu 


Met 


Thr 


Ala 




75 








80 


Ser 


His Phe Val 


He 


Leu 


Ala 


Glu 




90 






95 
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Asp Gly Lys Thr Phe Glu Ala Lys Ser Val lie lie Ala Thr Gly Gly 

100 105 110 

Ser Pro Lys Arg Thr Gly lie Lys Gly Glu Ser Glu Tyr Trp Gly Lys 

115 120 125 

Gly Val Ser Thr Cys Ala Thr Cys Asp Gly Phe Phe Tyr Lys Asn Lys 

130 135 140 

Glu Val Ala Val Leu Gly Gly Gly Asp Thr Ala Val Glu Glu Ala He 
145 150 155 160 

Tyr Leu Ala Asn He Cys Lys Lys Val Tyr Leu He His Arg Arg Asp 

165 170 175 

Gly Phe Arg Cys Ala Pro He Thr Leu Glu His Ala Lys Asn Asn Asp 

180 185 190 

Lys He Glu Phe Leu Thr Pro Tyr Val Val Glu Glu He Lys Gly Asp 

195 200 205 

Ala Ser Gly Val Ser Ser Leu Ser He Lys Asn Thr Ala Thr Asn Glu 

210 215 220 

Lys Arg Glu Leu Val Val Pro Gly Phe Phe He Phe Val Gly Tyr Asp 
225 230 235 240 

Val Asn Asn Ala Val Leu Lys Gin Glu Asp Asn Ser Met Leu Cys Lys 

245 250 255 

Cys Asp Glu Tyr Gly Ser lie Val Val Asp Phe Ser Met Lys Thr Asn 

260 " 265 270 

Val Gin Gly Leu Phe Ala Ala Gly Asp He Arg He Phe Ala Pro Lys 

275 280 285 

Gin Val Val Cys Ala Ala Ser Asp Gly Ala Thr Ala Ala Leu Ser Val 

290 295 300 

He Ser Tyr Leu Glu His His 
305 310 



<210> 212 

<211> 319 

<212> PRT 

<213> Listeria monocytogenes 



<400> 212 



Met 


Ala 


Ser 


Glu 


Glu 


Lys 


He 


Tyr Asp Val He He He Gly Ala Gly 


1 








5 






10 15 


Pro 


Ala 


Gly 


Met 


Thr 


Ala 


Ala 


Leu Tyr Thr Ser Arg Ala Asp Leu Asp 








20 








25 30 


Thr 


Leu 


Met 


He 


Glu 


Arg 


Gly 


Val Pro Gly Gly Gin Met Val Asn Thr 






35 






40 45 


Ala 


Glu 


Val 


Glu 


Asn 


Tyr 


Pro 


Gly Phe Asp Ser He Leu Gly Pro Asp 




50 










55 


60 


Leu 


Ser 


Asp 


Lys 


Met 


Leu 


Ser 


Gly Ala Lys Gin Phe Gly Ala Glu Tyr 


65 










70 




75 80 


Ala 


Tyr 


Gly 


Asp 


He 


Lys 


Glu 


Val Val Asp Gly Lys Glu Phe Lys Thr 










85 






90 95 


Val 


Thr 


Ala 


Gly 


Ser 


Lys 


Thr 


Tyr Lys Ala Arg Ala He He He Ala 








100 








105 110 


Thr 


Gly 


Ala 


Glu 


His 


Arg 


Lys 


Leu Gly Ala Ala Gly Glu Glu Glu Leu 






115 










120 125 


Ser 


Gly 


Arg 


Gly 


Val 


Ser 


Tyr 


Cys Ala Val Cys Asp Gly Ala Phe Phe 




130 










135 


140 


Lys 


Asn 


Arg 


Glu 


Leu 


lie 


Val 


Val Gly Gly Gly Asp Ser Ala Val Glu 


145 










150 




155 160 


Glu 


Gly 


Thr 


Tyr 


Leu 


Thr 


Arg 


Tyr Ala Asp Lys Val Thr He Val His 










165 






170 175 


Arg 


Arg 


Asp 


Lys 


Leu 


Arg 


Ala 


Gin Gin He Leu Gin Asp Arg Ala Phe 








180 








185 " 190 


Lys 


Asp 


Glu 


Lys 


Val 


Asp 


Phe 


He Trp Asn Ser Thr Val Glu Glu He 






195 










200 205 


Val 


Gly 


Asp 


Gly 


Lys 


Lys 


Val 


Thr Gly Ala Lys Leu Val Ser Thr Val 




210 










215 


220 


Asp 


Gly 


Ser 


Glu 


Ser 


He 


Met 


Pro Val Asp Gly Val Phe He Tyr Val 


225 










230 




235 240 


Gly 


Leu 


Val 


Pro 


Leu 


Thr. 


Lys 


Ala Phe Leu Asn Leu Gly He Thr Asp 
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245 



250 



255 



Asp Glu Gly Tyr He Val Thr Asp Glu Glu Met Arg Thr Asn Leu Pro 

260 265 270 

Gly He Phe Ala Ala Gly Asp Val Arg Ala Lys Ser Leu Arg Gin He 

275 280 285 

Val Thr Ala Thr Gly Asp Gly Gly Leu Ala Gly Gin Asn Ala Gin Lys 

290 295 " 300 

Tyr Val Glu Glu Leu Lys Glu Ser Leu Glu Ala Glu Ala Ala Lys 



<210> 213 
<211> 315 
<212> PRT 

<213> Mycoplasma genitalium 
<400> 213 

Met Leu Lys Val Asn Ala Asp Phe Leu Thr Lys Asp Gin Val He Tyr 

15 10 15 

Asp Leu Val He Val Gly Ala Gly Pro Ala Gly He Ala Ser Ala He 

20 25 30 

Tyr Gly Lys Arg Ala Asn Leu Asn Leu Ala He He Glu Gly Asn Thr 

35 " 40 45 

Pro Gly Gly Lys He Val Lys Thr Asn He Val Glu Asn Tyr Pro Gly 

50 ~ 55 60 

Phe Lys Thr He Thr Gly Pro Glu Leu Gly Leu Glu Met Tyr Asn His 
65 70 75 80 

Leu Leu Ala Phe Glu Pro Val Val Phe Tyr Asn Asn Leu He Lys He 

85 90 95 

Asp His Leu Asn Asp Thr Phe He Leu Tyr Leu Asp Asn Lys Thr Thr 

100 105 ' 110 

Val Phe Ser Lys Thr Val He Tyr Ala Thr Gly Met Glu Glu Arg Lys 

115 120 125 

Leu Gly He Glu Lys Glu Asp Tyr Phe Tyr Gly Lys Gly He Ser Tyr 

130 135 140 

Cys Ala He Cys Asp Ala Ala Leu Tyr Lys Gly Lys Thr Val Gly Val 
145 150 155 160 

Val Gly Gly Gly Asn Ser Ala He Gin Glu Ala He Tyr Leu Ser Ser 

165 170 175 

He Ala Lys Thr Val His Leu He His Arg Arg Glu Val Phe Arg Ser 

180 185 190 

Asp Ala Leu Leu Val Glu Lys Leu Lys Lys He Ser Asn Val Val Phe 

195 200 ~ * 205 

His Leu Asn Ala Thr Val Lys Gin Leu He Gly Gin Glu Lys Leu Gin 

210 215 220 

Thr Val Lys Leu Ala Ser Thr Val Asp Lys Ser Glu Ser Glu He Ala 
225 230 ^ 235 240 

He Asp Cys Leu Phe Pro Tyr He Gly Phe Glu Ser Asn Asn Lys Pro 

245 250 255 

Val Leu Asp Leu Lys Leu Asn Leu Asp Gin Asn Gly Phe He Leu Gly 

260 265 270 

Asp Glu Asn Met Gin Thr Asn He Lys Gly Phe Tyr Val Ala Gly Asp 

275 280 "* ~ 285 

Cys Arg Ser Lys Ser Phe Arg Gin He Ala Thr Ala He Ser Asp Gly 

290 295 300 

Val Thr Ala Val Leu Lys Val Arg Asp Asp He 
305 310 315 



<210> 214 
<211> 458 
<212> PRT 

<213> Mycobacterium leprae 
<400> 214 

Met Asn Thr Thr Pro Ser Ala His Glu Thr He His Glu Val He Val 
15 10 15 



305 



310 



315 
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lie Gly Ser Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg 

20 25 30 

Ala Gin Leu Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala 

35 40 45 

Leu Met Thr Thr Thr Glu Val Glu Asn Tyr Pro Gly Phe Arg Asn Gly 

50 55 60 

lie Thr Gly Pro Glu Leu Met Asp Asp Met Arg Glu Gin Ala Leu Arg 
65 70 75 80 

Phe Gly Ala Glu Leu Arg Thr Glu Asp Val Glu Ser Val Ser Leu Arg 

85 90 95 

Gly Pro He Lys Ser Val Val Thr Ala Glu Gly Gin Thr Tyr Gin Ala 

100 105 110 

Arg Ala Val He Leu Ala Met Gly Thr Ser Val Arg Tyr Leu Gin He 

115 120 125 

Pro Gly Glu Gin Glu Leu Leu Gly Arg Gly Val Ser Ala Cys Ala Thr 

130 ' 135 " " 140 

Cys Asp Gly Ser Phe Phe Arg Gly Gin Asp He Ala Val He Gly Gly 
145 150 ~ * 155 • 160 

Gly Asp Ser Ala Met Glu Glu Ala Leu Phe Leu Thr Arg Phe Ala Arg 

165 170 175 

Ser Val Thr Leu Val His Arg Arg Asp Glu Phe Arg Ala Ser Lys He 

180 185 190 

Met Leu Gly Arg Ala Arg Asn Asn Asp Lys He Lys Phe He Thr Asn 

195 200 205 

His Thr Val Val Ala Val Asn Gly Tyr Thr Thr Val Thr Gly Leu Arg 

210 215 220 

Leu Arg Asn Thr Thr Thr Gly Glu Glu Thr Thr Leu Val Val Thr Gly 
225 230 235 240 

Val Phe Val Ala He Gly His Glu Pro Arg Ser Ser Leu Val Ser Asp 

245 250 255 

Val Val Asp He Asp Pro Asp Gly Tyr Val Leu Val Lys Gly Arg Thr 

260 265 270 

Thr Ser Thr Ser Met Asp Gly Val Phe Ala Ala Gly Asp Leu Val Asp 

275 280 285 

Arg Thr Tyr Arg Gin Ala He Thr Ala Ala Gly Ser Gly Cys Ala Ala 

290 ' 295 300 

Ala He Asp Ala Glu Arg Trp Leu Ala Glu His Ala Gly Ser Lys Ala 
305 310 315 320 

Asn Glu Thr Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr Thr 

325 " ~ 330 ~ 335 

Asp Trp Ser Thr Ala Met Thr Asp Ala Lys Aen Ala Gly Val Thr He 

340 ^ 345 350 

Glu Val Thr Asp Ala Ser Phe Phe Ala Asp Val Leu Ser Ser Asn Lys 

355 360 365 

Pro Val Leu Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys Lys Met 

370 375 380 

Val Ala Pro Val Leu Glu Glu lie Ala Ser Glu Gin Arg Asn Gin Leu 
385 390 395 400 

Thr Val Ala Lys Leu Asp Val Asp Thr Asn Pro Glu Met Ala Arg Glu 
' 405 410 415 

Phe Gin Val Val Ser He Pro Thr Met He Leu Phe Gin Gly Gly Gin 

420 425 430 

Pro Val Lys Arg He Val Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg 

435 440 445 

Asp Leu Ser Asp Val Val Pro Asn Leu Asn 
450 455 



<210> 215 
<211> 315 
<212> PRT 

<213> Mycoplasma pneumoniae 
<400> 215 

Met Leu Lys Val Lys Ser Asp Phe Leu Thr Lys Asp Gin Val He Tyr 

1 5 10 15 

Asp Val Ala He Val Gly Ala Gly Pro Ala Gly He Ala Ala Gly He 
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20 25 • 30 



Tyr 


Gly 


Lys Arg Ala 


Asn 


Leu 


Asn 


Leu Ala He 


He Glu Gly Ser Thr 




35 










40 




45 


Pro 


Gly 


Gly Lys Val 


Val 


Lys 


Thr 


Asn He Val 


Glu As*n. Tyr Pro Gly 




50 










55 






60 


Tyr 


Lys 


Ser 


lie 


Thr 


Gly 


Pro 


Asp 


Leu Gly Leu 


Glu Met Tyr Asn His 


65 










70 






75 


80 


Leu 


lie 


Asp 


Leu 


Glu 


Pro 


Thr 


Phe 


Phe Tyr Ala 


Asn Leu He Lys Leu 










85 








90 


95 


Asp 


Lys 


Ala Ala Asp 


Thr 


Phe 


He 


Leu Tyr Leu 


Asp Asp Lys Thr Val 








100 










105 


110 


Val 


Phe 


Ala Lys Thr 


Val 


He 


Tyr 


Ala Thr Gly 


Met Leu Glu Arg Lys 






115 










120 




125 


Leu 


Gly 


Val 


Ala 


Lys 


Glu 


Asp 


His 


Phe Tyr Gly 


Lys Gly He Ser Tyr 




130 










135 






140 


Cys 


Ala 


lie Cys Asp 


Gly 


Ser 


Leu 


Tyr Lys Asp 


Gin Val Val Gly Val 


145 










150 






155 


160 


Val 


Gly 


Gly Gly Asn 


Ser 


Ala 


He 


Gin Glu Ala 


Leu Tyr Leu Ala Ser 








165 








170 


175 


Met 


Ala 


Lys 


Thr 


Val 


His 


Leu 


He 


His Arg Arg 


Glu Gly Phe Arg Ala 








180 










185 


190 


Asp 


Glu 


Thr 


Ala 


Leu 


Asn 


Lys 


Leu 


Arg Asn Leu 


Pro Asn Val Val Phe 






195 








200 


205 


His 


Leu 


Asn Tyr Thr 


Val 


Lys 


Glu 


Leu Leu Gly 


Asn Asn Thr Leu Asn 




210 










215 






220 


Gly 


lie 


Val 


Leu 


Gin 


Asn 


Thr 


Leu 


Asp His Ser 


Thr Lys Gin He Asp 


225 










230 






235 


240 


Leu 


Asn 


Cys Val 


Phe 


Pro 


Tyr 


He 


Gly Phe Glu 


Ser He Thr Lys Pro 










245 






250 


255 


Val 


Glu 


His 


Leu 


Asn 


Leu 


Lys 


Leu 


Asp Pro Gin 


Gly Phe Leu He Thr 








260 










265 


270 


Asn 


Glu 


Gin Met 


Glu 


Thr 


Ser 


Leu 


Lys Gly Leu 


Phe Ala Ala Gly Asp 






275 










280 




285 


Cys 


Arg 


Ser Lys His 


Phe 


Arg 


Gin 


He Gly Thr 


Ala He Asn Asp Gly 




290 










295 






300 


lie 


lie 


Ala 


Val 


Leu 


Thr 


He 


Arg 


.rlJajJ V ell IJCU 




305 










310 




315 




<210> 216 
















<211> 311 
















<212> PRT 
















<213> Mycobacterium 


smegmatis 






<400> 216 
















Met 


Ser 


Thr 


Ser 


Gin 


Thr 


Val 


His 


Asp Val He 


He He Gly Ser Gly 


1 








5 








10 


15 


Pro 


Ala 


Gly 


Tyr 


Thr 


Ala 


Ala 


He 


Tyr Ala Ala 


Arg Ala Gin Leu Lys 








20 










25 


30 


Pro 


Leu 


Val 


Phe 


Glu 


Gly 


Thr 


Gin 


Phe Gly Gly 


Ala Leu Met Thr Thr 






35 










40 




45 


Thr 


Glu 


Val 


Glu 


Asn 


Tyr 


Pro 


Gly 


Phe Arg Glu 


Gly He Thr Gly Pro 




50 










55 






60 


Glu 


Leu 


Met 


Asp 


Gin 


Met 


Arg 


Glu 


Gin Ala Leu 


Arg Phe Arg Ala Asp 


65 










70 






75 


80 


Leu 


Arg 


Met 


Glu 


Asp 


Val 


Asp 


Ala 


Val Gin Leu 


Glu Gly Pro Val Lys 










85 








90 


95 


Thr 


Val 


Val 


Val 


Gly 


Asp 


Glu 


Thr 


His Gin Ala 


Arg Ala Val He Leu 








100 








105 


110 


Ala 


Met 


Gly 


Ala 


Ala 


Ala 


Arg 


His 


Leu Gly Val 


Pro Gly Glu Glu Ala 






115 








120 


125 


Leu 


Thr 


Gly 


Met 


Gly 


Val 


Ser 


Thr 


Cys Ala Thr 


Cys Asp Gly Phe Phe 




130 










135 






140 


Phe 


Arg 


Asp 


Gin 


Asp 


He 


Val 


Val 


Val Gly Gly 


Gly Asp Ser Ala Met 


145 










150 






155 


160 


Glu 


Glu 


Ala 


Thr 


Phe 


Leu 


Thr 


Arg 


Phe Ala Arg 


Ser Val Thr Leu He 










165 






170 


175 
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His 


Arg 


Arg Asp 


Glu Phe 


Arg 


Ala 






180 








Arg 


Ala 


Asn Glu 


Lys He 


Thr 


Phe 






195 




200 


He 


Glu 


Gly Asp 


Pro Lys 


Val 


Thr 




210 




215 




Thr 


Gly 


Glu Glu 


Ser Lys 


Leu 


Asp 


225 






230 






Gly 


His 


Asp Pro 


Arg Ser 


Glu 


Leu 








245 






Asp 


Glu 


Gly Tyr 


Val Lys 


Val 


Gin 






260 








Asp 


Gly 


Val Phe 


Ala Ala 


Gly 


Asp 






275 






280 


Ala 


He 


Thr Ala 


Ala Gly 


Ser 


Gly 




290 






295 




Arg 


Trp 


Leu Ala 


Glu Gin 


Asp 




305 






310 







Ser Lys He Met Leu Glu Arg Ala 
185 190 
Leu Thr Asn Thr Glu He Thr Gin 
205 

Gly Val Arg Leu Arg Asp Thr Val 
220 

Val Thr Gly Val Phe Val Ala He 
235 240 
Val Arg Gly Gin Val Glu Leu Asp 

250 . 255 

Gly Arg Thr Thr Tyr * Thr Ser Leu 
265 270 
Leu Val Asp His Thr Tyr Arg Gin 
285 

Cys Ala Ala Ser He Asp Ala Glu 
300 



<210> 217 
<211> 335 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 217 

Met Thr Ala Pro Pro Val His Asp Arg Ala His His Pro Val Arg Asp 

15 10 15 

Val He Val He Gly Ser Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr 

20 25 30 

Ala Ala Arg Ala Gin Leu Ala Pro Leu Val Phe Glu Gly Thr Ser Phe 

35 40 45 

Gly Gly Ala Leu Met Thr Thr Thr Asp Val Glu Asn Tyr Pro Gly Phe 

50 55 60 

Arg Asn Gly He Thr Gly Pro Glu Leu Met Asp Glu Met Arg Glu Gin 
65 70 75 ~ 80 

Ala Leu Arg Phe Gly Ala Asp Leu Arg Met Glu Asp Val Glu Ser Val 

85 90 95 

Ser Leu His Gly Pro Leu Lys Ser Val Val Thr Ala Asp Gly Gin Thr 

100 105 110 

His Arg Ala Arg Ala Val He Leu Ala Met Gly Ala Ala Ala Arg Tyr 

115 120 125 

Leu Gin Val Pro Gly Glu Gin Glu Leu Leu Gly Arg Gly Val Ser Ser 

130 135 140 

Cys Ala Thr Cys Asp Gly Phe Phe Phe Arg Asp Gin Asp He Ala Val 
145 150 155 160 

He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Thr Phe Leu Thr Arg 

165 170 175 

Phe Ala Arg Ser Val Thr Leu Val His Arg Arg Asp Glu Phe Arg Ala 

180 185 190 

Ser Lys He Met Leu Asp Arg Ala Arg Asn Asn Asp Lys He Arg Phe 

195 200 205 

Leu Thr Asn His Thr Val Val Ala Val Asp Gly Asp Thr Thr Val Thr 

210 215 220 

Gly Leu Arg Val Arg Asp Thr Asn Thr Gly Ala Glu Thr Thr Leu Pro 
225 230 235 , 240 

Val Thr Gly Val Phe Val Ala He Gly His Glu Pro Arg Ser Gly Leu 

245 250 255 

Val Arg Glu Ala He Asp Val Asp Pro Asp Gly Tyr Val Leu Val Gin 

260 265 " 270 

Gly Arg Thr Thr Ser Thr Ser Leu Pro Gly Val Phe Ala Ala Gly Asp 

275 280 285 

Leu Val Asp Arg Thr Tyr Arg Gin Ala Val Thr Ala Ala Gly Ser Gly 

290 295 300 

Cys Ala Ala Ala He Asp Ala Glu Arg Trp Leu Ala Glu His Ala Ala 
305 310 315 320 

Thr Gly Glu Ala Asp Ser Thr Asp Ala Leu He Gly Ala Gin Arg 
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325 



<210> 218 
<211> 334 
<212> PRT 

<213> Neurospora crassa 



<400> 218 



Met 


His 


Ser 


Lys Val 


Val He He 


Gly 


1 
Ala 


Ala 


He 


5 

Tyr Leu 


Ala Arg Ala 


Glu 








20 




25 


Gly 


Phe 


Met 


Ala Asn 


Gly He Ala 


Ala 






35 




40 . 




Thr 


Glu 


He 


Glu Asn 


Phe Pro Gly 


Phe 




50 






55 




Glu 


Leu 


Met 


Asp Lys 


Met Lys Ala 


Gin 


65 








70 




lie 


He 


Ser 


Glu Thr 


Val Ala Lys 


Val 








85 






Lys 


Tyr 


Ala 


Thr Glu 


Trp Ser Pro 


Glu 








100 




105 


lie 


He 


Leu 


Ala Thr 


Gly Ala Ser 


Ala 






115 




120 




Glu 


Glu 


Lys 


Tyr Trp 


Gin Asn Gly 


He 




130 






135 




Gly 


Ala 


Val 


Pro He 


Phe Arg Asn 


Lys 


145 








150 




Gly 


Asp 


Ser 


Ala Ala 


Glu Glu Ala 


Met 








165 






His 


Val 


Thr 


Val Leu 


Val Arg Lys 


Asp 








180 




185 


Met 


Ala 


His 


Arg Leu 


Leu Asn His 


Glu 






195 


200 




Thr 


Val 


Gly 


Val Glu 


Val Lys Gly 


Asp 




210 






215 




Leu 


Val 


Val 


Lys Asp 


Val Thr Thr 


Gly 


225 








230 




Asn 


Gly 


Leu 


Phe Tyr 


Ala He Gly 


His 








245 






Lys 


Gly 


Gin 


Leu Glu 


Thr Asp Ala 


Asp 








260 




265 


Gly 


Thr 


Thr 


Leu Thr 


Ser Val Glu 


Gly 




275 




280 




Gin 


Asp 


Lys 


Arg Tyr 


Arg Gin Ala 


He 




290 






295 




Met 


Ala 


Ala 


Leu Asp 


Ala Glu Lys 


Phe 


305 








310 




Pro 


Ala 


Glu 


His Arg 


Asp Thr Ser 


Ala 








325 







<210> 219 
<211> 333 
<212> PRT 

<213> Penicillium chrysogenum 
<400> 219 

Val His Ser Lys Val Val He He Gly 

1 5 
Ala Ala He Tyr Leu Ser Arg Ala Glu 

20 25 
Gly Met Leu Ala Asn Gly Thr Ala Ala 

35 40 
Thr Asp Val Glu Asn Phe Pro Gly Phe 
50 55 



PCT/US01/50240 



330 










335 




Ser 


Gly 


Pro 


Ala 


Ala 


His 


Thr 


10 








15 




Leu 


Lys 


Pro 


Val 


Leu 


Tyr 


Glu 










30 






Gly 


Gly 


Gin 


Leu 


Thr 


Thr 


Thr 








45 








Pro 


Asp 


Gly 


He 


Met 


Gly Gin 






60 










Ser 


Glu 


Arg 


Phe 


Gly 


Thr 


Gin 




75 










80 


Asp 


Leu 


Ser 


Ala 


Arg 


Pro 


Phe 


90 










95 




Glu 


Tyr 


His 


Thr 


Ala 


Asp 


Ser 










110 






Arg 


Arg 


Leu 


His 


Leu 


Pro Gly 








125 








Ser 


Ala 


Cys 


Ala 


Val 


Cys 


Asp 






140 










His 


Leu 


Val 


Val 


He 


Gly 


Gly 




155 










160 


Tyr 


Leu 


Thr 


Lys 


Tyr 


Gly 


Ser 


170 










175 




Lys 


Leu 


Arg 


Ala 


Ser 


Ser 


He 










190 






Lys 


Val 


Thr 


Val 


Arg 


Phe 


Asn 








205 








Asp 


Lys 


Gly 


Leu 


Met 


Ser 


His 






220 










Lys 


Glu 


Glu 


Thr 


Leu 


Glu 


Ala 


235 










240 


Asp 


Pro 


Ala 


Thr 


Ala 


Leu 


Val 


250 










255 




Gly 


Tyr 


Val 


Val 


Thr 


Lys 


Pro 










270 






Val 


Phe 


Ala 


Ala 


Gly 


Asp 


Val 








285 








Thr 


Ser 


Ala 


Gly 


Thr 


Gly 


Cys 






300 










Leu 


Ser 


Glu 


His 


Glu 


Glu 


Thr 




315 










320 


Val 


Gin 


Gly 


Asn 


Leu 






330 














Ser 


Gly 


A±a 


Gly 


Ala 


His 


Thr 


10 










15 




Leu 


Gin 


Pro 


Val 


Leu 


Tyr 


Glu 










30 






Gly 


Gly 


Gin 


Leu 


Thr 


Thr 


Thr 








45 








Pro 


Ser 


Gly 


He 


Gly 


Gly 


Ala 
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Glu Leu Met Asp Asn Met Arg Ala Gin Ser Glu Arg Phe Gly Thr Glu 
65 70 75 80 

lie He Thr Glu Thr He Ser Lys Leu Asp Leu Ser Ser Arg Pro Phe 

85 90 95 

Lys Met Trp Thr Glu Trp Asn Asp Asp Glu Gly Ser Glu Pro Val Arg 

100 105 " 110 

Thr Ala Asp Ala Val He He Ala Thr Gly Ala Asn Ala Arg Arg Leu 

115 120 125 

Asn Leu Pro Gly Glu Glu Thr Tyr Trp Gin Asn Gly He Ser Ala Cys 

130 135 140 

Ala Val Cys Asp Gly Ala Val Pro He Phe Arg Asn Lys Pro Leu Tyr 
145 150 155 160 

Val He Gly Gly Gly Asp Ser Ala Ala Glu Glu Ala Met Phe Leu Ala 

165 170 175 

Lys Tyr Gly Ser Ser Val Thr Val Leu Val Arg Lys Asp Lys Leu Arg 

180 185 190 

Ala Ser Asn He Met Ala Asp Arg Leu Leu Ala His Pro Lys Cys Lys 

195 ^ 200 205 

Val Arg Phe Asn Thr Val Ala Thr Glu Val lie Gly Glu Asn Lys Pro 

210 215 220 

Asn Gly Leu Met Thr His Leu Arg Val Lys Asp Val Leu Ser Asn Ala 
225 230 235 240 

Glu Glu Val Val Glu Ala Asn Gly Leu Phe Tyr Ala Val Gly His Asp 

245 250 255 

Pro Ala Ser Gly Leu Val Lys Gly Gin Val Glu Leu Asp Asp Glu Gly 

260 265 270 

Tyr He He Thr Lys Pro Gly Thr Ser Phe Thr Asn Val Glu Gly Val 

275 ~ 280 285 

Phe Ala Cys Gly Asp Val Gin Asp Lys Arg Tyr Arg Gin Ala He Thr 

290 295 " A 300 

Ser Ala Gly Ser Gly Cys Val Ala Ala Leu Glu Ala Glu Lys Phe He 
305 310 315 320 

Ala Glu Thr Glu Thr His Gin Glu Ala Lys Pro Val Leu 
325 330 



<210> 220 
<211> 310 
<212> PRT 

<213> Rickettsia prowazekii 
<400> 220 

Met Lys He Thr Thr Lys Val Leu He He Gly Ser Gly Pro Ala Gly 

1 5 10 15 

Leu Ser Ala Ala He Tyr Thr Ala Arg Ser Ala Leu Lys Pro He Leu 

20 25 30 

He Asn Gly Met Gin Pro Gly Gly Gin Leu Thr Met Thr Thr Asp Val 

35 40 45 

Glu Asn Tyr Pro Gly Phe Ala Glu Thr He Gin Gly Pro Trp Leu Met 

50 55 60 

Glu Gin Met Ser Met Gin Ala Lys Asn Val Gly Thr Glu He He Ser 
65 70 75 80 

Asp Tyr Val Glu Arg Val Asp Leu Ser Lys Arg Pro Phe Lys He Phe 

85 90 95 

Thr Gly Thr Gly Asn Glu Tyr Glu Ala Asp Ser He He He Cys Thr 

100 105 110 - 

Gly Ala Glu Ser Lys Trp Leu Gly He Ala Ser Glu Gin Glu Phe Arg 

115 120 125 

Gly Phe Gly Val Ser Ser Cys Ala He Cys Asp Gly Phe Phe Phe Lys 

130 135 140 

Asn Gin Glu He Val Val Val Gly Gly Gly Asn Ser Ala Leu Glu Glu 
145 150 " 155 160 

Ala Leu Tyr Leu Thr Asn His Ala Asn Lys Val Thr Val Val His Arg 

165 170 175 

Arg Asn Ser Phe Arg Ala Glu Lys He Leu Gin Asp Arg Leu Phe Lys 

180 185 190 

Asn Pro Lys He Ser Val He Trp Asp His He He Asp Glu He Val 
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195 








200 


Gly 


Ser 


Asn Lys 


Pro 


Lys Ala 


Val 




210 








215 




Tyr 


Thr 


Asn Glu 


He 


Asn 


Leu 


Val 


225 








230 






He 


Gly 


His Ala 


Pro 


Asn 


Thr 


Ala 








245 








Asp 


Asp 


Asp Asn Tyr 


He 


Val 


Thr 






260 










Val 


Glu 


Gly Val 


Phe 


Ala 


Ala 


Gly 






275 








280 


Gin 


Ala 


Val Thr 


Ala 


Ala 


Ala 


Ser 




290 








295 




Ala 


Lys 


Phe Leu 


Asn 


Lys 






305 








310 







205 

Thr Gly Val Lys He Gin Asn Val 
220 

Asn Cys Ser Gly Val Phe He Ala 
235 240 
Leu Phe Lys Gly Gin He Ala He 

250 ~ " 255 
Gin Ser Gly Ser Thr Arg Thr Asn 
265 270 
Asp Val Gin Asp Lys He Tyr Arg 
285 

Gly Cys Met Ala Ala Leu Glu Val 
300 



<210> 221 
<211> 322 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 221 

Met Thr His Asn Lys Val Val He He Gly Ser Gly Pro Ala Gly His 

15 10 15 

Thr Ala Ala He Tyr Leu Ala Arg Gly Glu Leu Lys Pro Val Met Tyr 

20 25 3-0 

Glu Gly Met Leu Ala Asn Gly He Ala Ala Gly Gly Gin Leu Thr Thr 

35 40 45 

Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Asp Gly He Asn Gly 

50 55 60 

Thr Thr Leu Thr Glu Asn Phe Arg Ala Gin Ser Leu Arg Phe Gly Thr 
65 70 75 80 

Glu He He Thr Glu Thr Val Ser Lys Leu Asp Leu Ser Ser Arg Pro 

85 90 95 

Phe Lys Tyr Trp Leu Glu Gly Ala Glu Glu Glu Glu Pro His Thr Ala 

100 105 110 

Asp Ser Val He Leu Ala Thr Gly Ala Ser Ala Arg Arg Leu His He 

115 120 125 

Thr Gly Glu Asp Thr Tyr Trp Gin Ala Gly He Ser Ala Cys Ala Val 

130 135 140 

Cys Asp Gly Ala Val Pro He Tyr Arg Asn Lys Pro Leu Ala Val Val 
145 150 155 160 

Gly Gly Gly Asp Ser Ala Ala Glu Glu Ala Gin Phe Leu Thr Lys Tyr 

165 170 175 

Gly Ser Lys Val Tyr Val Leu Val Arg Arg Asp Lys Leu Arg Ala Ser 

180 185 190 

Pro He Met Ala Lys Arg Leu Leu Ala Asn Pro Lys Val Glu Val Leu 

195 200 205 

Trp Asn Thr Val Ala Glu Glu Ala Gin Gly Asp Gly Lys Leu Leu Asn 

210 215 220 

Asn Leu Arg He Lys Asn Thr Asn Thr Asn Glu Val Ser Asp Leu Gin 
225 230 235 240 

Val Asn Gly Leu Phe Tyr Ala He Gly His He Pro Ala Thr Lys Leu 

245 250 255 

Val Ala Glu Gin He Glu Leu Asp Glu Ala Gly Tyr He Lys Thr He 

260 265 ~ 270 

Asn Gly Thr Pro Arg Thr Ser He Pro Gly Phe Phe Ala Ala Gly Asp 

275 280 285 

Val Gin Asp Lys Val Phe Arg Gin Ala He Thr Ser Ala Gly Ser Gly 

290 295 300 

Cys Gin Ala Ala Leu Leu Ala Met His Tyr Leu Glu Glu Leu Glu Asp 
305 310 315 320 

Thr Asp 
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<210> 222 
<211> 321 
<212> PRT 

<213> Streptomyces clavuligerus 
<400> 222 

Ser Asp Val Arg Asn Val lie He He Gly Ser Gly Pro Ala Gly Tyr 

15 10 ~ 15 

Thr Ala Ala Leu Tyr Thr Ala Arg Ala Ser Leu Gin Pro Leu Val Phe 

20 25 30 

Glu Gly Ala Val Thr Ala Gly Gly Ala Leu Met Asn Thr Thr Asp Val 

35 40 45 

Glu Asn Phe Pro Gly Phe Arg Asp Gly He Met Gly Pro Asp Leu Met 

50 55 60 

Asp Asn Met Arg Ala Gin Ala Glu Arg Phe Gly Ala Glu Leu He Pro 
65 70 " 75 80 

Asp Asp Val Val Ser Val Asp Leu Thr Gly Asp He Lys Thr Val Thr 

85 90 95 

Asp Ser Ala Gly Thr Val His Arg Ala Lys Ala Val He Val Thr Thr 

100 105 110 

Gly Ser Gin His Arg Lys Leu Gly Leu Pro Arg Glu Asp Ala Leu Ser 

115 120 125 

Gly Arg Gly Val Ser Trp Cys Ala Thr Cys Asp Gly Phe Phe Phe Lys 

130 135 140 

Asp Gin Asp He Val Val Val Gly Gly Gly Asp Thr Ala Met Glu Glu 
145 150 155 160 

Ala Thr Phe Leu Ser Arg Phe Ala Lys Ser Val Thr He Val His Arg 

165 170 175 

Arg Asp Ser Leu Arg Ala Ser Lys Ala Met Gin Asp Arg Ala Phe Ala 

180 185 190 

Asp Pro Lys He Ser Phe Ala Trp Asn Ser Glu Val Ala Thr He His 

195 200 205 

Gly Glu Gin Lys Leu Thr Gly Leu Thr Leu Arg Asp Thr Lys Thr Gly 

210 215 220 

Glu Thr Arg Glu Leu Ala Ala Thr Gly Leu Phe He Ala Val Gly His 
225 230 235 240 

Asp Pro Arg Thr Glu Leu Phe Lys Gly Gin Leu Asp Leu Asp Asp Glu 

245 250 255 

Gly Tyr Leu Lys Val Ala Ser Pro Ser Thr Arg Thr Asn Leu Thr Gly 

260 265 270 

Val Phe Ala Ala Gly Asp Val Val Asp His Thr Tyr Arg Gin Ala He 

275 280 285 

Thr Ala Ala Gly Thr Gly Cys Ser Ala Ala Leu Asp Ala Glu Arg Tyr 

290 295 300 

Leu Ala Ala Leu Ala Asp Ser Glu Gin He Ala Glu Pro Ala Pro Ala 
305 310 315 320 

Val 



<210> 223 
<211> 321 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 223 

Ser Asp Val Arg Asn Val He He He Gly Ser Gly Pro Ala Gly Tyr 

1 5 10 15 

Thr Ala Ala Leu Tyr Thr Ala Arg Ala Ser Leu Lys Pro Leu Val Phe 

20 25 " 30 

Glu Gly Ala Val Thr Ala Gly Gly Ala Leu Met Asn Thr Thr Glu Val 

35 40 45 

Glu Asn Phe Pro Gly Phe Gin Asp Gly He Met Gly Pro Glu Leu Met 

50 55 60 

Asp Asn Met Arg Ala Gin Ala Glu Arg Phe Gly Ala Glu Leu He Pro 
65 70 75 80 

Asp Asp Val Val Ala Val Asp Leu Ser Gly Glu He Lys Thr Val Thr 
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Asp Thr Ala 

Gly Ser Gin 
115 

Gly Arg Gly 

130 
Asp Gin Asp 
145 

Ala Thr Phe 

Arg Asp Thr 

Asp Pro Lys 
195 

Gly Asp Gin 

210 
Glu Leu Ser 
225 

Asp Pro Arg 

Gly Tyr Leu 

Val Phe Gly 
275 

Thr Ala Ala 

290 
Leu Ala Ala 
305 
Val 



85 

Gly Thr Val 
100 

His Arg Lys 

Val Ser Trp 

lie Ala Val 
150 

Leu Ser Arg 

165 
Leu Arg Ala 
180 

lie Ser Phe 

Lys Leu Ala 

Asp Leu Pro 
230 

Thr Glu Leu 

245 
Lys Val Asp 
260 

Ala Gly Asp 

Gly Thr Gly 

Leu Ser Asp 
310 



His Arg 

Leu Gly 
120 
Cys Ala 
135 

lie Gly 

Phe Ala 

Ser Lys 

Val Trp 
200 
Gly Leu 
215 

Val Thr 

Phe Lys 

Ala Pro 

Val Val 
280 
Cys Ser 
295 

Glu Asp 



90 

Ala Lys 
105 

Leu Pro 

Thr Cys 

Gly Gly 

Lys Ser 
170 
Ala Met 
185 

Asp Ser 

Lys Leu 

Gly Leu 

Gly Gin 
250 
Ser Thr 
265 

Asp His 
Ala Ala 
Lys Ala 



Ala Val He 

Asn Glu Asp 
125 

Asp Gly Phe 

140 
Asp Thr Ala 
155 

Val Thr He 

Gin Glu Arg 

Glu Val Ala 
205 

Arg Asn Val 

220 
Phe He Ala 
235 

Leu Asp Leu 

Arg Thr Asn 

Thr Tyr Arg 
285 

Val Asp Ala 

300 
Glu Pro Glu 
315 



95 

Val Thr Thr 
110 

Ala Leu Ser 

Phe Phe Lys 

Met Glu Glu 
160 

Val His Arg 
175 

Ala Phe Ala 
190 

Glu Val Gin 

Lys Thr Gly 

He Gly His 
240 

Asp Pro Glu 

255 
Leu Thr Gly 
270 

Gin Ala He 

Glu Pro Phe 

Lys Thr Ala 
320 



<210> 224 
<211> 307 
<212> PRT 

<213> Treponema pallidiutn 



<400> 224 



Met 


Glu 


Thr 


Asp 


Tyr 


Asp 


Val 


He He Val Gly Ala Gly Ala 


Ala Gly 


1 






5 






10 


15 


Leu 


Ser 


Ala 


Ala 


Gin 


Tyr 


Ala 


Cys Arg Ala Asn Leu Arg Thr 


Leu Val 








20 








25 30 




He 


Glu 


Ser 


Lys 


Ala 


His 


Gly 


Gly Gin Ala Leu Leu He Asp 


Ser Leu 






35 










40 45 




Glu 


Asn 


Tyr 


Pro 


Gly 


Tyr 


Ala 


Thr Pro He Ser Gly Phe Glu 


Tyr Ala 




50 










55 


60 




Glu 


Asn 


Met 


Lys 


Lys 


Gin 


Ala 


Val Ala Phe Gly Ala Gin He 


Ala Tyr 


65 






70 




75 


80 


Glu 


Glu 


Val 


Thr 


Thr 


He 


Gly 


Lys Arg Asp Ser Val Phe His 


He Thr 










85 




90 


95 


Thr 


Gly 


Thr 


Gly 


Ala 


Tyr 


Thr 


Ala Met Ser Val He Leu Ala 


Thr Gly 






100 






105 110 




Ala 


Glu 


His 


Arg 


Lys 


Met 


Gly 


He Pro Gly Glu Ser Glu Phe 


Leu Gly 






115 










120 125 




Arg 


Gly 


Val 


Ser 


Tyr 


Cys 


Ala 


Thr Cys Asp Gly Pro Phe Phe' 


Arg Asn 




130 










135 


140 




Lys 


His 


Val 


Val 


Val 


He 


Gly 


Gly Gly Asp Ala Ala Cys Asp 


Glu Ser 


145 










150 




155 


160 


Leu 


Val 


Leu 


Ser 


Arg 


Leu 


Thr 


Asp Arg Val Thr Met He His 


Arg Arg 










165 






170 


175 


Asp 


Thr 


Leu 


Arg 


Ala 


Gin 


Lys 


Ala He Ala Glu Arg Thr Leu 


Lys Asn 








180 








185 190 




Pro 


His 


He 


Ala 


Val 


Gin 


Trp 


Asn Thr Thr Leu Glu Ala Val 


Arg Gly 






195 










200 205 




Glu 


Thr 


Lys 


Val 


Ser 


Ser 


Val 


Leu Leu Lys Asp Val Lys Thr 


Gly Glu 




210 










215 


220 
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Thr Arg Glu 


Leu 


Ala Cys 


Asp Ala 


225 




230 




Pro lie Thr 


Gly 


Leu Leu 


Pro Asp 






245 




lie Val Thr 


Asp 


Asp Glu 


Met Arg 




260 




Ala Gly Asp 


Val 


Arg Ala 


Lys Ser 


275 






280 


Ser Asp Gly 


Ala 


Leu Ala 


Ala His 


290 






295 



Leu Gin Asn 



305 



Val Phe Phe Phe He Gly Met Val 
235 240 
Ala Glu Lys Asp Ser Thr Gly Tyr 

250 255 
Thr Ser Val Glu Gly He Phe Ala 
265 270 
Phe Arg Gin Val He Thr Ala Thr 
285 

Ala Ala Ala Ser Tyr He Asp Thr 
300 



<210> 225 
<211> 45 
<212> PRT 

<213> Vibrio fischeri 
<400> 225 

Met Asn Val Lys His Ser Lys Leu Leu He Leu Gly Ser Gly Pro Ala 

1 5 10 15 

Gly Tyr Thr Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Asn Pro Val 

20 25 30 

Met He Thr Gly Met Gin Gin Gly Gly Gin Leu Thr Asn 
35 40 45 



<210> 226 
<211> 318 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 226 

Val His Asn Lys Val Thr He He Gly Ser Gly Pro Ala Ala His Thr 

15 10 15 

Ala Ala He Tyr Leu Ala Arg Ala Glu He Lys Pro He Leu Tyr Glu 

20 25 30 

Gly Met Met Ala Asn Gly He Ala Ala Gly Gly Gin Leu Thr Thr Thr 

35 40 45 

Thr Glu He Glu Asn Phe Pro Gly Phe Pro Asp Gly Leu Thr Gly Ser 

50 55 60 

Glu Leu Met Asp Arg Met Arg Glu Gin Ser Thr Lys Phe Gly Thr Glu 
65 70 75 80 

lie He Thr Glu Thr Val Ser Lys Val Asp Leu Ser Ser Lys Pro Phe 

85 90 95 

Lys Leu Trp Thr Glu Phe Asn Glu Asp Ala Glu Pro Val Thr Thr Asp 

100 105 110 

Ala lie He Leu Ala Thr Gly Ala Ser Ala Lys Arg Met His Leu Pro 

115 120 ~ 125 

Gly Glu Glu Thr Tyr Trp Gin Lys Gly He Ser Ala Cys Ala Val Cys 

130 135 140 

Asp Gly Ala Val Pro He Phe Arg Asn Lys Pro Leu Ala Val He Gly 
145 150 155 160 

Gly Gly Asp Ser Ala Cys Glu Glu Ala Gin Phe Leu Thr Lys Tyr Gly 

165 170 ,175 

Ser Lys Val Phe Met Leu Val Arg Lys Asp His Leu Arg Ala Ser Thr 

180 185 & 190 

He Met Gin Lys Arg Ala Glu Lys Asn Glu Lys He Glu He Leu Tyr 

195 200 205 

Asn Thr Val Ala Leu Glu Ala Lys Gly Asp Gly Lys Leu Leu Asn Ala 

210 _ 215 220 

Leu Arg He Lys Asn Thr Lys Lys Asn Glu Glu Thr Asp Leu Pro Val 
225 230 235 240 

Ser Gly Leu Phe Tyr Ala He Gly His Thr Pro Ala Thr Lys He Val 

245 250 255 

Ala Gly Gin Val Asp Thr Asp Glu Ala Gly Tyr He Lys Thr Val Pro 
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260 

Gly Ser Ser Leu Thr Ser Val Pro 
275 280 
Gin Asp Ser Lys Tyr Arg Gin Ala 

290 295 
Met Ala Ala Leu Asp Ala Glu Lys 
305 310 



265 270 
Gly Phe Phe Ala Ala Gly Asp Val 
285 

lie Thr Ser Ala Gly Ser Gly Cys 
300 

Tyr Leu Thr Ser Leu Glu 
315 



<210> 227 
<211> 342 
<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 227 



Met 


lie 


Lvs 


His 


He 


Val 


Ser Pro Phe Arg Thr Asn 


Phe Val Gly He 


1 






5 




10 


15 


Ser 


Lvs 


Ser 


Val 


Leu 


Ser 


Arg Met He His His Lys 


Val Thr He He 






20 






25 


30 


Glv 


Ser 


Glv 


Pro 


Ala 


Ala 


His Thr Ala Ala He Tyr 


Leu Ala Arg Ala 




35 








40 


45 


Glu 


Met 


Lys 


Pro 


Thr 


Leu 


Tyr Glu Gly Met Met Ala 


Asn Gly He Ala 




50 










55 60 




Ala 


Glv 


Glv 


Gin 


Leu 


Thr 


Thr Thr Thr Asp He Glu 


Asn Phe Pro Gly 










70 


75 


80 


Phe 


Pro 


Glu 


Ser 


Leu 


Ser 


Gly Ser Glu Leu Met Glu 


Arg Met Arg Lys 










85 




90 


95 


Gin 


Ser 


Ala 


Lys 


Phe Gly 


Thr Asn He He Thr Glu 


Thr Val Ser Lys 














105 


110 


Val 


Asp 


Leu 


Ser 


Ser 


Lys 


Pro Phe Arg Leu Trp Thr 


Glu Phe Asn Glu 




115 








120 


125 


Asp 


Ala 


Glu 


Pro 


Val 


Thr 


Thr Asp Ala He He Leu 


Ala Thr Gly Ala 


130 










135 140 




Ser 


Ala 


Lvs 


Ara 


Met 


His 


Leu Pro Gly Glu Glu Thr 


Tyr Trp Gin Gin 


145 










150 


155 


160 


Gly 


lie 


Ser 


Ala 


Cys 


Ala 


Val Cvs Asn Glv Ala Val 


Pro He Phe Arg 










165 




170 


175 


Asn 


Lys 


Pro 


Leu 


Ala 


Val 


He Gly Gly Gly Asp Ser 


Ala Cys Glu Glu 








180 






185 


190 


Ala 


Glu 


Phe 


Leu 


Thr 


Lys 


Tyr Ala Ser Lys Val Tyr 


He Leu Val Arg 






195 








200 


205 


Lys 


Asp 


His 


Phe 


Arg Ala 


Ser Val He Met Gin Arg 


Arg He Glu Lys 




210 










215 220 




Asn 


Pro 


Asn 


He 


He 


Val 


Leu Phe Asn Thr Val Ala 


Leu Glu Ala Lys 


225 










230 


235 


240 


Gly 


Asp 


Gly 


Lys 


Leu 


Leu 


Asn Met Leu Arg He Lys 


Asn Thr Lys Ser 










245 




250 


255 


Asn 


Val 


Glu 


Asn 


Asp 


Leu 


Glu Val Asn Gly Leu Phe 


Tyr Ala He Gly 








260 






265 


270 


His 


Ser 


Pro 


Ala 


Thr Asp 


lie Val Lys Gly Gin Val 


Asp Glu Glu Glu 






275 








280 


285 


Thr 


Gly 


Tyr 


He 


Lys 


Thr 


Val Pro Gly Ser Ser Leu 


Thr Ser Val Pro 




290 










295 300 




Gly 


Phe 


Phe 


Ala 


Ala Gly 


Asp Val Gin Asp Ser Arg 


Tyr Arg Gin Ala 


305 










310 


315 


320 


Val 


Thr 


Ser 


Ala 


Gly Ser 


Gly Cys He Ala Ala Leu 


Asp Ala Glu Arg 










325 




330 


335 


Tyr 


Leu 


Ser 


Ala 


Gin 


Glu 










340 











<210> 228 
<211> 499 
<212> PRT 
<213> Bos taurus 

<400> 228 
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Met 


Asn 


Gly 


Ser 


Lys 


Asp 


Leu 


Pro Glu 


Pro Tyr 


Asp Tyr Asp Leu He 


1 








5 








10 


15 


He 


He 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly Leu 


Ala Ala 


Ala Lys Glu Ala Ala 








20 








25 




30 


Lys 


Tyr 


Asp 


Lys 


Lys 


Val 


Met 


Val Leu 


Asp Phe 


Val Thr Pro Thr Pro 




35 










40 




45 


Leu 


Gly 


Thr 


Arg 


Trp 


Gly 


Leu 


Gly Gly 


Thr Cys 


Val Asn Val Gly Cys 




50 










55 






60 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin Ala 


Ala Leu 


Leu Gly Gin Ala Leu 


65 






70 






75 


80 


Arg 


Asp 


Ser 


Arg 


Asn 


Tyr 


Gly 


Trp Asn 


Val Glu 


Glu Thr Val Lys His 










85 








90 


95 


Asp 


Trp 


Glu 


Arg 


Met 


Thr 


Glu 


Ala Val 


Gin Asn 


His He Gly Ser Leu 






100 








105 




110 


Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Ala 


Leu Arg 


Glu Lys 


Lys Val Thr Tyr Glu 






115 










120 




125 


Asn 


Ala 


Tyr 


Gly 


Glu 


Phe 


Val 


Gly Pro 


His Arg 


He Lys Ala Thr Asn 




130 










135 






140 


Asn 


Lys 


Gly 


Lys 


Glu 


Lys 


He 


Tyr Ser 


Ala Glu 


Arg Phe Leu He Ala 


145 










150 






155 


160 


Thr 


Gly 


Glu 


Arg 


Pro 


Arg 


Tyr 


Leu Gly 


He Pro 


Gly Asp Lys Glu Tyr 










165 








170 


175 


Cys 


He 


Ser 


Ser 


Asp 


Asp 


Leu 


Phe Ser 


Leu Pro 


Tyr' Cys Pro Gly Lys 








180 








185 




190 


Thr 


Leu 


Val 


Val 


Gly 


Ala 


Ser 


Tyr Val 


Ala Leu 


Glu Cys Ala Gly Phe 






195 








200 




205 


Leu 


Ala 


Gly 


He 


Gly 


Leu 


Asp 


Val Thr 


Val Met 


Val Arg Ser He Leu 




210 










215 






220 


Leu 


Arg 


Gly 


Phe 


Asp 


Gin 


Asp 


Met Ala 


Asn Lys 


He Gly Glu His Met 


225 










230 






235 


240 


Gin 


Glu 


His 


Gly 


He 


Lys 


Phe 


He Arg 


Gin Phe 


Val Pro He Lys Val 










245 








250 


255 


Glu 


Gin 


He 


Glu 


Ala 


Gly 


Thr 


Pro Gly 


Arg Leu 


Arg Val He Ala Lys' 








260 






265 




270 


Ser 


Thr 


Asp 


Ser 


Asp 


Gin 


Thr 


He Glu 


Gly Glu 


Tyr Asn Thr Val Leu 






275 








280 




285 


Leu 


Ala 


He 


Gly 


Arg 


Asp 


Ala 


Cys Thr 


Arg Lys 


He Gly Leu Glu Asn 




290 










295 






300 


Val 


Gly 


Val 


Lys 


He 


Asn 


Glu 


Lys Thr 


Gly Lys 


He Pro Val Thr Glu 


305 










310 






315 


320 


Glu 


Glu 


Gin 


Thr 


Asn 


Val 


Pro 


Tyr He 


Tyr Ala 


He Gly Asp He Leu 










325 






330 


335 


Glu 


Gly 


Lys 


Leu 


Glu 


Leu 


Thr 


Pro Val 


Ala He 


Gin Ala Gly Arg Leu 








340 








345 




350 


Leu 


Ala 


Gin 


Arg 


Leu 


Tyr 


Gly 


Gly Ser 


Thr Val 


Lys Cys Asp Tyr Glu 






355 










360 




365 


Asn 


Val 


Pro 


Thr 


Thr 


Val 


Phe 


Thr Pro 


Leu Glu 


Tyr Gly Ser Cys Gly 




370 










375 






380 


Leu 


Ser 


Glu 


Glu 


Lys 


Ala 


Val 


Glu Lys 


Phe Gly 


Glu Glu Asn Val Glu 


385 








390 




395 


400 


Val 


Tyr 


His 


Ser 


Tyr 


Phe 


Trp 


Pro Leu 


Glu Trp 


Thr He Pro Ser Arg 










405 








410 


415 


Asp 


Asn 


Asn 


Lys 


Cys 


Tyr 


Ala 


Lys Val 


Val Cys 


Asn He Lys Asp Asn 








420 








425 




430 


Glu 


Arg 


Val 


Val 


Gly 


Phe 


His 


Val Leu 


Gly Pro 


Asn Ala Gly Glu Val 






435 










440 




445 


Thr 


Gin 


Gly 


Phe 


Ala 


Ala 


Ala 


Leu Lys 


Cys Gly 


Leu Thr Lys Asp Gin 




450 










455 






460 


Leu 


Asp 


Ser 


Thr 


lie 


Gly 


He 


His Pro 


Val Cys 


Ala Glu Val Phe Thr 


465 










470 






475 


480 


Thr 


Leu 


Ser 


Val 


Thr 


Lys 


Arg 


Ser Gly 


Gly Asn 


He Leu Gin Thr Gly 










485 








490 


495 


Cys 


Cys 


Gly 

















<210> 229 
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<211> 523 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 229 



Met Tyr 


He 


Lys 


Gly Asn 


Ala 


Val Gly Gly Leu Lys Glu Leu 


Lys 


Ala 


1 






5 




10 


15 




Leu Lys 


Gin 


Asp 


Tyr Leu 


Lys 


Glu Trp Leu Arg Asp His Thr 


Tyr Asp 






20 






25 30 






Leu lie 


Val 


He 


Gly Gly 


Gly 


Ser Gly Gly Leu Ala Ala Ala 


Lys 


Glu 




35 








40 45 






Ala Ser 


Arg 


Leu 


Gly Lys 


Lys 


Val Ala Cys Leu Asp Phe Val 


Lys 


Pro 


50 








55 


60 






Ser Pro 


Gin 


Gly 


Thr Ser 


Trp 


Gly Leu Gly Gly Thr Cys Val 


Asn 


Val 


65 






70 




75 




80 


Gly Cys 


He 


Pro 


Lys Lys 


Leu 


Met His Gin Ala Ser Leu Leu 


Gly His 








85 




90 


95 




Ser lie 


His 


Asp 


Ala Lys 


Lys 


Tyr Gly Trp Lys Leu Pro Glu 


Gly Lys 






100 






105 110 






Val Glu 


His 


Gin 


Trp Asn 


His 


Leu Arg Asp Ser Val Gin Asp 


TT-5 a . 


lie 




115 






120 125 






Ala Ser 


Leu 


Asn 


Trp Gly 


Tyr 


Arg Val Gin Leu Arg Glu Lys 


mr 


vai 


130 








135 


140 






Thr Tyr 


He 


Asn 


Ser Tyr 


Gly 


Glu Phe Thr Gly Pro Phe Glu 


lxe 


Ser 


145 






150 




155 »'/ 




JLOU 


Ala Thr 


Asn 


Lvs 


Lvs Lvs 


Lvs 


Val Glu Lys Leu Thr" Ala Asp 


Arg 


irne 








165 




170 


JL / D 




Leu lie 


Ser 


Thr 


Gly Leu 


Arq 


Pro Lys Tyr Pro Glu He Pro 


Ciiy 


vai 






180 






185 190 






Lys Glu 


Tvr 


Thr 


He Thr 


Ser 


Asp Asp Leu Phe Gin Leu Pro 


Tyr 


Ser 




195 








200 205 






Pro Gly 


Lys 


Thr 


Leu Cvs 


Val 


Gly Ala Ser Tyr Val Ser Leu 


vjJIU 


Cys 


210 








215 


220 






Ala Gly 


Phe 


Leu 


His Gly 


Phe 


Gly Phe Asp Val Thr Val Met 


vax 


Arg 


225 






230 




235 




240 


Ser lie 


Leu 


Leu 


Ara Glv 


Phe 


Asp Gin Asp Met Ala Glu Arg 


± ±G 


Arg 








245 




250 


TCC 
£t 3 Zj 




Lys His 


Met 


He 


Ala Tyr 


Glv 


Met Lys Phe Glu Ala Gly Val 


Pro 


IXlX 






260 






265 270 






Arg lie 


Glu 


Gin 


He Asp 


Glu 


Lys Thr Asp Glu Lys Ala Gly 


Lys 


Tyr 




275 








280 285 






Arg Val 


Phe 


Trp 


Pro Lys 


Lys 


Asn Glu Glu Thr Gly Glu Met 


r*-\ y. 


uIU 


290 








295 


300 






Val Ser 


Glu 


Glu 


Tyr Asn 


Thr 


He Leu Met Ala He Gly Arg 


Glu 


Ala 


305 






310 




315 




320 


Val Thr 


Asp 


Asp 


Val Gly 


Leu 


Thr Thr He Gly Val Glu Arg 


Ala 


Lys 








325 




330 


335 




Ser Lys 


Lys 


Val 


Leu Gly 


Arg 


Arg Glu Gin Ser Thr Thr He 


Pro 


Trp 






340 






345 350 






Val Tyr 


Ala 


He 


Gly Asp 


Val 


Leu Glu Gly Thr Pro Glu Leu 


Thr 


Pro 


355 






360 365 






Val Ala 


He 


Gin 


Ala Gly 


Ara 


Val Leu Met Arg Arg He Phe 


Asp 


Gly 


370 








375 


380 






Ala Asn 


Glu 


Leu 


Thr Glu 


Tyr 


Asp Gin He Pro Thr Thr Val 


Phe 


Thr 


385 






390 


395 




400 


Pro Leu 


Glu 


Tyr 


Gly Cys 


Cys 


Gly Leu Ser Glu Glu Asp Ala' 


Met 


Met 








405 




410 


415 




Lys Tyr 


Gly 


Lys 


Asp Asn 


He 


He He Tyr His Asn Val Phe 


Asn 


Pro 






420 






425 430 






Leu Glu 


Tyr 


Thr 


He Ser 


Glu 


Arg Met Asp Lys Asp His Cys 


Tyr 


Leu 




435 








440 ~ . 445 






Lys Met 


He 


Cys 


Leu Arg 


Asn 


Glu Glu Glu Lys Val Val Gly 


Phe 


His 


450 








455 


460 






lie Leu 


Thr 


Pro 


Asn Ala 


Gly 


Glu Val Thr Gin Gly Phe Gly 


He 


Ala 


465 






470 




475 




480 


Leu Lys 


Leu 


Ala 


Ala Lys 


Lys 


Ala Asp Phe Asp Arg Leu He 


Gly 


He 








485 




490 


495 
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His Pro Thr Val Ala Glu Asn Phe 
500 

Glu Gly Asp Glu Glu Leu Gin Ala 
515 520 



Thr Thr Leu Thr Leu Glu Lys Lys 
505 510 
Ser Gly Cys 



<210> 230 
<211> 497 
<212> PRT 

<213> Homo sapiens 
<400> 230 

Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp Tyr Asp Leu lie 

1 5 10 ' 15 

lie lie Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Ala Ala 

20 25 30 

Gin Tyr Gly Lys Lys Val Met Val Leu Asp Phe Val Thr Pro Thr Pro 

35 40 45 

Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 

50 55 60 

lie Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 ~ 80 

Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu Thr Val Lys His 

85 90 95 

Asp Trp Asp Arg Met lie Glu Ala Val Gin Asn His lie Gly Ser Leu 

100 105 110 

Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 

115 120 125 

Asn Ala Tyr Gly Gin Phe lie Gly Pro His Arg lie Lys Ala Thr Asn 

130 135 140 

Asn Lys Gly Lys Glu Lys lie Tyr Ser Ala Glu Ser Phe Leu lie Ala 
145 150 155 160 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly lie Pro Gly Asp Lys Glu Tyr 

165 170 175 

Cys lie Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 185 ~ 190 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

195 200 205 

Leu Ala Gly lie Gly Leu Gly Val Thr Val Met Val Arg Ser He Leu 

210 215 220 

Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He Gly Glu His Met 
225 230 235 240 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro He Lys Val 

245 250 255 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Val Ala Gin 

260 265 270 

Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr Asn Thr Val Met 

275 280 285 

Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He Gly Leu Glu Thr 

290 295 300 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
.305 310 315 320 

Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 

325 330 335 

Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350' 

Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys Cys Asp Tyr Glu 

355 360 365 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Ala Cys Gly 

370 375 380 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 
385 390 395 400 

Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr He Pro Ser Arg 

405 410 * 415 

Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn Thr Lys Asp Asn 

420 425 430 

Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 
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435 440 
Thr Gin Gly Phe Ala Ala Ala Leu 

450 455 
Leu Asp Ser Thr lie Gly lie His 
465 470 
Thr Leu Ser Val Thr Lys Arg Ser 
485 

Cys 



445 

Lys Cys Gly Leu Thr Lys Lys Gin 
460 

Pro Val Cys Ala Glu Val Phe Thr 
475 480 
Gly Ala Ser lie Leu Gin Ala Gly 
490 495 



<210> 231 
<211> 541 
<212> PRT 

<213> Plasmodium falciparum 
<400> 231 

Met Cys Lys Asp Lys Asn Glu Lys Lys Asn Tyr Glu His Val Asn Ala 

1 .5 10 15 

Asn Glu Lys Asn Gly Tyr Leu Ala Ser Glu Lys Asn Glu Leu Thr Lys 

20 25 30 

Asn Lys Val Glu Glu His Thr Tyr Asp Tyr Asp Tyr Val Val He Gly 

35 40 45 

Gly Gly Pro Gly Gly Met Ala Ser Ala Lys Glu Ala Ala Ala His Gly 

50 55 60 

Ala Arg Val Leu Leu Phe Asp Tyr Val Lys Pro Ser Ser Gin Gly Thr 
65 70 75 80 

Lys Trp Gly He Gly Gly Thr Cys Val Asn Val Gly Cys Val Pro Lys 

85 90 ^ 95 

Lys Leu Met His Tyr Ala Gly His Met Gly Ser He Phe Lys Leu Asp 

100 105 ^ 110 

Ser Lys Ala Tyr Gly Trp Lys Phe Asp Asn Leu Lys His Asp Trp Lys 

115 120 i25 

Lys Leu Val Thr Thr Val Gin Ser His He Arg Ser Leu Asn Phe Ser 

130 135 140 

Tyr Met Thr Gly Leu Arg Ser Ser Lys Val Lys Tyr He Asn Gly Leu 
145 150 155 160 

Ala Lys Leu Lys Asp Lys Asn Thr Val Ser Tyr Tyr Leu Lys Gly Asp 

165 170 175 

Leu Ser Lys Glu Glu Thr Val Thr Gly Lys Tyr He Leu He Ala Thr 

180 185 * 190 

Gly Cys Arg Pro His He Pro Asp Asp Val Glu Gly Ala Lys Glu Leu 

195 200 205 

Ser He Thr Ser Asp Asp He Phe Ser Leu Lys Lys Asp Pro Gly Lys 

210 215 220 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ser Gly Phe 
225 230 235 240 

Leu Asn Ser Leu Gly Tyr Asp Val Thr Val Ala Val Arg Ser He Val 

245 250 255 

Leu Arg Gly Phe Asp Gin Gin Cys Ala Val Lys Val Lys Leu Tyr Met 

260 265 270 

Glu Glu Gin Gly Val Met Phe Lys Asn Gly He Leu Pro Lys Lys Leu 

275 280 285 

Thr Lys Met Asp Asp Lys He Leu Val Glu Phe Ser Asp Lys Thr Ser 

290 295 300 

Glu Leu Tyr Asp Thr Val Leu Tyr Ala He Gly Arg Lys Gly Asp He 
305 310 315 ~ 320 

Asp Gly Leu Asn Leu Glu Ser Leu Asn Met Asn Val Asn Lys Ser Asn 

325 330 335 

Asn Lys He He Ala Asp His Leu Ser Cys Thr Asn He Pro Ser He 

340 345 350 

Phe Ala Val Gly Asp Val Ala Glu Asn Val Pro Glu Leu Ala Pro Val 

355 360 365 

Ala- He Lys Ala Gly Glu He Leu Ala Arg Arg Leu Phe Lys Asp Ser 

370 " 375 "* 380 

Asp Glu He Met Asp Tyr Ser Tyr He Pro Thr Ser He Tyr Thr Pro 
385 390 395 400 
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lie 


Glu 


Tyr 


Gly Ala Cys Gly Tyr 








405 


Tyr 


Gly 


Lys 


Ser Asn Val Glu Val 








420 


Glu 


lie 


Ser 


Ala Val His Arg Gin 






435 


440 


Glu 


Tyr 


Asp 


Leu Asp Val Ser Ser 




450 




455 


Leu 


Lys 


Asn 


Glu Asp Asn Arg Val 


465 






470 


Asn 


Ala 


Gly 


Glu Val Thr Gin Gly 








485 


Val 


Lys 


Lys 


Lys Asp Phe Asp Asn 








500 


Ala 


Glu 


Ser 


Phe Met Asn Leu Phe 






515 


520 


Tyr 


Ala 


Ala 


Lys Gly Gly Cys Gly 




530 




535 



Ser Glu Glu Lys Ala Tyr Glu Leu 

410 415 
Phe Leu Gin Glu Phe Asn Asn Leu 
425 430 
Lys His He Arg Ala Gin Lys Asp 
445 

Thr Cys Leu Ala Lys Leu Val Cys 
460 

He Gly Phe His Tyr Val Gly Pro 
475 480 
Met Ala Leu Ala Leu Arg Leu Lys 

490 495 
Cys He Gly He His Pro Thr Asp 
505 510 
Val Thr He Ser Ser Gly Leu Ser 
525 

Gly Gly Lys Cys Gly 
540 



<210> 232 
<211> 535 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 232 

Met Ala Ala Ser Pro Lys He Gly He Gly He Ala Ser Val Ser Ser 

1 5 10 15 

Pro His Arg Val Ser Ala Ala Ser Ser Ala Leu Ser Pro Pro Pro His 

20 25 30 

Leu Phe Phe Leu Thr Thr Thr Thr Thr Thr Arg His Gly Gly Ser Tyr 

35 40 45 

Leu Leu Arg Gin Pro Thr Arg Thr Arg Ser Ser Asp Ser Leu Arg Leu 

50 55 60 

Arg Val Ser Ala Thr Ala Asn Ser Pro Ser Ser Ser Ser Ser Gly Gly 
65 70 75 80 

Glu He He Glu Asn Val Val He He Gly Ser Gly Pro Ala Gly Tyr 

85 90 95 

Thr Ala Ala He Tyr Ala Ala Arg Ala Asn Leu Lys Pro Val Val Phe 

100 105 110 

Glu Gly Tyr Gin Met Gly Gly Val Pro Gly Gly Gin Leu Met Thr Thr 

115 120 125 

Thr Glu Val Glu Asn Phe Pro Gly Phe Pro Asp Gly He Thr Gly Pro 

130 135 140 

Asp Leu Met Glu Lys Met Arg Lys Gin Ala Glu Arg Trp Gly Ala Glu 
145 150 155 160 

Leu Tyr Pro Glu Asp Val Glu Ser Leu Ser Val Thr Thr Ala Pro Phe 

165 170 175 

Thr Val Gin Thr Ser Glu Arg Lys Val Lys Cys His Ser He He Tyr 

180 185 190 

Ala Thr Gly Ala Thr Ala Arg Arg Leu Arg Leu Pro Arg Glu Glu Glu 

195 200 205 

Phe Trp Ser Arg Gly He Ser Ala Cys Ala He Cys Asp Gly Ala Ser 

210 215 220 

Pro Leu Phe Lys Gly Gin Val Leu Ala Val Val Gly Gly Gly Asp Thr 
225 230 235 240 

Ala Thr Glu Glu Ala Leu Tyr Leu Thr Lys Tyr Ala Arg His Val His 

245 250 255 ' 

Leu Leu Val Arg Arg Asp Gin Leu Arg Ala Ser Lys Ala Met Gin Asp 

260 265 270 

Arg Val He Asn Asn Pro Asn He Thr Val His Tyr Asn Thr Glu Thr 

275 280 285 

Val Asp Val Leu Ser Asn Thr Lys Gly Gin Met Ser Gly He Leu Leu 

290 295 300 

Arg Arg Leu Asp Thr Gly Glu Glu Thr Glu Leu Glu Ala Lys Gly Leu 
305 310 315 320 

Phe Tyr Gly He Gly His Ser Pro Asn Ser Gin Leu Leu Glu Gly Gin 
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325 330 335 

Val Glu Leu Asp Ser Ser Gly Tyr Val Leu Val Arg Glu Gly Thr Ser 

340 345 350 

Asn Thr Ser Val Glu Gly Val Phe Ala Ala Gly Asp Val Gin Asp His 

355 360 365 

Glu Trp Arg Gin Ala Val Thr Ala Ala Gly Ser Gly Cys He Ala Ala 

370 375 380 

Leu Ser Ala Glu Arg Tyr Leu Thr Ser Asn Asn Leu Leu Val Glu Phe 
385 390 395 400 

His Gin Pro Gin Thr Glu Glu Ala Lys Lys Glu Phe Thr Gin Arg Asp 

405 410 415 

Val Gin Glu Lys Phe Asp He Thr Leu Thr Lys His Lys Gly Gin Tyr 

420 *" 425 430 

Ala Leu Arg Lys Leu Tyr His Glu Ser Pro Arg Val He Leu Val Leu 

435 440 445 

Tyr Thr Ser Pro Thr Cys Gly Pro Cys Arg Thr Leu Lys Pro He Leu 

450 455 460 

Asn Lys Val Val Asp Glu Tyr Asn His Asp Val His Phe Val Glu He 
465 470 475 480 

Asp He Glu Glu. Asp Gin Glu He Ala Glu Ala Ala Gly He Met Gly 

485 490 495 

Thr Pro Cys Val Gin Phe Phe Lys Asn Lys Glu Met Leu Arg Leu Gly 

500 505 510 

Asn Val Leu Ser Val Leu Lys Leu His Arg Leu Leu Cys Ser Gly. Leu 

515 520 525 

Ala Lys Asp Ser Glu Ser Val 
530 535 



<210> 233 
<211> 117 
<212> PRT 

<213> Helianthus annuus 
<400> 233 

Ala Val Val Glu Ala Tyr Gly Glu Glu Gly Lys Asn Val Leu Gly Gly 

15 10 15 

Leu Lys Val Lys Asn Val Val Ser Gly Glu Val Ser Asp Leu Lys Val 

20 25 30 

Asn Gly Leu Phe Phe Ala He Gly His Glu Pro Ala Thr Lys Phe Leu 

35 40 45 

Asp Gly Gin Leu Glu Leu Asp Ser Asp Gly Tyr Val Val Thr Lys Pro 

50 55 60 

Gly Thr Thr He Ser Ser Val Lys Gly Val Phe Ala Ala Gly Asp Val 
65 70 75 * 80 

Gin Asp Lys Lys Tyr Arg Gin Ala Val Thr Ala Ala Gly Ser Gly Cys 

85 90 95 

Met Ala Ala Leu Asp Ala Glu His Tyr Leu Gin Glu He Gly Ser Gin 

100 105 110 

Glu Gly Lys Ser Asp 
115 



<210> 234 
<2U> 300 
<212> PRT 

<213> Arcaeoglobus fulgidus 
<400> 234 

Met Tyr Asp Val Ala He He Gly 

1 5 
Ala Leu Tyr Ser Ala Arg Tyr Gly 
20 

Val Asp Pro Val Ser Gin Leu Ser 

35 40 
Pro Gly Phe Glu Gly Ser Gly Met 
50 55 



Gly Gly Pro Ala Gly Leu Thr Ala 

10 ~ 15 

Leu Lys Thr Val Phe Phe Glu Thr 
25 30 
Leu Ala Ala Lys He Glu Asn Tyr 
45 

Glu Leu Leu Glu Lys Met Lys Glu 
60 
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Gin Ala Val Lys Ala Gly Ala Glu Trp Lys Leu Glu Lys Val Glu Arg 
65 70 " 75 80 

Val Glu Arg Asn Gly Glu Thr Phe Thr Val lie Ala Glu Gly Gly Glu 

85 90 95 

Tyr Glu Ala Lys Ala lie lie Val Ala Thr Gly Gly Lys His Lys Glu 

100 105 " 110 

Ala Gly lie Glu Gly Glu Ser Ala Phe He Gly Arg Gly Val Ser Tyr 

115 120 125 

Cys Ala Thr Cys Asp Gly Asn Phe Phe Arg Gly Lys Lys Val He Val 

130 135 140 

Tyr Gly Ser Gly Lys Glu Ala He Glu Asp Ala He Tyr Leu His Asp 
145 150 155 160 

He Gly Cys Glu Val Thr He Val Ser Arg Thr Pro Ser Phe Arg Ala 

165 170 175 

Glu Lys Ala Leu Val Glu Glu Val Glu Lys Arg Gly He Pro Val His 

180 185 190 

Tyr Ser Thr thr He Arg Lys He He Gly Ser Gly Lys Val Glu Lys 

195 " 200 205 

Val Val Ala Tyr Asn Arg Glu Lys Lys Glu Glu Phe Glu He Glu Ala 

210 215 220 

Asp Gly He Phe Val Ala He Gly Met Arg Pro Ala Thr Asp Val Val 
225 230 235 240 

Ala Glu Leu Gly Val Glu Arg Asp Ser Met Gly Tyr He Lys Val Asp 

245 250 255 

Lys Glu Gin Arg Thr Asn Val Glu Gly Val Phe Ala Ala Gly Asp Cys 

260 265 270 

Cys Asp Asn Pro Leu Lys Gin Val Val Thr Ala Cys Gly Asp Gly Ala 

275 280 285 

Val Ala Ala Tyr Ser Ala Tyr Lys Tyr Leu Thr Ser 
290 295 * 300 



<210> 235 
<211> 315 
<212> PRT 

<213> Bacillus halodurans 



<400> 235 



Met 


Gly 


Glu 


Glu 


Gin 


Lys 


Val Tyr 


1 








•5 






Pro 


Ala 


Gly 


Met 


Thr 


Ala 


Ala Val 








20 








Thr 


Val 


Met 


Val 


Glu 


Arg 


Gly Val 






35 








40 


Glu 


Asp 


Val 


Glu 


Asn 


Tyr 


Pro Gly 




50 










55 


Leu 


Ser 


Thr 


Lys 


Met 


Phe 


Glu His 


65 










70 




Ala 


Tyr 


Gly 


Asp 


He 


Lys 


Glu He 










85 






Val 


Lys 


Ala 


Gly 


Asn 


Lys 


Glu Tyr 








100 








Thr 


Gly 


Ala 


Glu 


Tyr 


Lys 


Lys Leu 






115 








120 


Ser 


Gly 


Arg 


Gly 


Val 


Ser 


Tyr Cys 




130 










135 


Lys 


Gly 


Lys 


Glu 


Leu 


Val 


Val Val 


145 










150 




Glu 


Ala 


Val 


Tyr 


Leu 


Thr 


Arg Phe 










165 




Arg 


Arg 


Asp 


Gin 


Leu 


Arg 


Ala Gin 








180 








Asp 


Asn 


Asp 


Lys 


He 


Glu 


Phe He 






195 








200 


Asn 


Gly 


Thr 


Asp 


Gly 


Lys 


Val Ser 




210 










215 


Thr 


Gly 


Glu 


Gin 


Gin 


Asp 


Phe Lys 



Asp Val Val He Ala Gly Ala Gly 

10 15 
Tyr Thr Ser Arg Ala Asn Leu Ser 
25 30 
Pro Gly Gly Gin Met Ala Asn Thr 
45 

Phe Asp His He Leu Gly Pro Glu 
60 

Ala Lys Lys Phe Gly Ala Glu Tyr 

75 80 
He Asp Gin Gly Asp Leu Lys Leu 

90 ^ 95 

Lys Ala Arg Ala Val He Val Ala 
105 110 
Gly Val Pro Gly Glu Lys Glu Leu 
125 

Ala Val Cys Asp Gly Ala Phe Phe 
140 

Gly Gly Gly Asp Ser Ala Val Glu 
155 160 
Ala Ser Lys Val Thr He He His 

170 175 
Lys He Leu Gin Gin Arg Ala Phe 
185 190 
Trp Asp His Val Val Lys Gin He 
205 

Ser Val Thr He Glu His Ala Lys 
220 

Thr Asp Gly Val Phe He Tyr He 



-138- 



WO 02/50289 



PCT/US01/50240 



225 



230 



235 



240 



Gly Met Leu Pro Leu Asn Glu Ala Val Lys Asn Leu Asn lie Leu Asn 

245 250 255 

Asp Glu Gly Tyr lie Val Thr Asn Glu Glu Met Glu Thr Ser Val Pro 

260 265 270 

Gly lie Phe Ala Ala Gly Asp Val Arg Glu Lys Ser Leu Arg Gin lie 

275 280 " 285 

Val Thr Ma Thr Gly Asp Gly Ser Leu Ala Ala Gin Asn Val Gin His 

290 295 300 

Tyr lie Glu Glu Leu Ala Glu Lys Val Lys Asn 
305 310 315 



<210> 236 
<211> 330 
<212> PRT 

<213> Bacillus halodurans 
<400> 236 

Met Ser Arg Lys Glu Glu Leu Tyr Asp lie Thr lie lie Gly Gly Gly 

1 5 10 15 

Pro Thr Gly Leu Phe Ala Ala Phe Tyr Gly Gly Met Arg Gin Ala Lys 

20 25 30 

Val Lys lie lie Glu Ser Met Pro Gin Leu Gly Gly Gin Leu Ala Ala 

35 40 " 45 

Leu Tyr Pro Glu Lys Tyr lie Tyr Asp Val Ala Gly Phe Pro Lys Val 

50 55 60 

Lys Ala Gin Asp Leu Val Asn Asp Leu Lys Arg Gin Ala Glu Gin Phe 
65 70 75 80 

Asn Pro Thr lie Ala Leu Glu Gin Ser Val Gin Asn Val Thr Lys Glu 

85 90 95 

Thr Asp Asp Thr Phe Thr He Lys Thr Asp Lys Glu Thr His Tyr Ser 

100 105 110 

Lys Ala He He He Thr Ala Gly Ala Gly Ala Phe -Gin Pro Arg Arg 

115 120 125 

Leu Glu Val Glu Gly Ala Lys Gin Tyr Glu Gly Lys Asn Leu Gin Tyr 

130 135 * 140 

Phe Val Asn Asp Leu Asn Ala Tyr Ala Gly Lys Asn Val Leu He Ser 
145 150 " 155 160 

Gly Gly Gly Asp Ser Ala Val Asp Trp Ala Leu Met Leu Glu Pro Val 

165 170 175 

Ala Lys Asn Val Thr Leu He His Arg Arg Asp Lys Phe Arg Ala His 

180 185 " 190 

Glu His Ser Val Glu Leu Leu Gin Lys Ser Ser Val Asn He Leu Thr 

195 200 205 

Pro Phe Ala He Ser Glu Leu Ser Gly Asp Gly Glu Lys He His His 

210 215 220 

Val Thr He Gin Glu Val Lys Gly Asp Ala Val Glu Thr Leu Asp Val 
225 230 235 240 

Asp Glu Val He Val Asn Phe Gly Phe Val Ser Ser Leu Gly Pro He 

245 250 255 

Lys Gly Trp Gly Leu Glu He Glu Lys Asn Ser He Val Val Asn Thr 

260 265 270 

Lys Met Glu Thr Asn He Pro Gly He Tyr Ala Ala Gly Asp He Cys 

275 280 285 

Thr Tyr Pro Gly Lys Val Lys Leu He Ala Thr Gly Phe Gly Glu Ala 

290 295 300 

Pro Thr Ala Val Asn Asn Ala Lys Ala Phe He Asp Pro Thr Ala Arg 
305 310 315 320 

Val Phe Pro Gly His Ser Thr Ser Leu Phe 



325 



330 



<210> 237 
<211> 213 
<212> PRT 



<213> Bacillus halodurans 
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<400> 237 












Met 


Thr 


Asn 


Leu 


His 


Tyr 


Thr Val Lys Ser Leu Met Arg Phe 


Lys Asp 


1 








5 




10 


15 


Lys 


Thr 


Val 


He 


lie 


Ser 


Gly Gly Gly Asn Ser Ala He Asp 


Trp Ala 








20 






25 30 




Asn 


Glu 


Leu 


Glu 


Pro 


He 


Ala Lys Lys Val Tyr Leu Thr Tyr 


Arg Lys 






35 








40 45 




Glu 


Ala 


Leu 


Asn 


Gly 


His 


Glu Ala Gin He Ser Gin Leu Leu 


Ser Ser 




50 








55 60 




Ser 


Ala 


Thr 


Cys 


Leu 


Phe 


His Thr Thr He Ser Lys Leu He 


Ala Arg 


65 










70 


75 


80 


Asp 


Asn 


Lys 


Glu 


Val 


He 


Glu Gin Val Glu Leu Thr Asp His 


Gin Thr 










85 




90 


95 


Gly 


Glu 


Val 


Thr 


Asn 


Leu 


Ala Val Asp Glu Val He He Asn 


His Gly 








100 






105 110 


Tyr 


Glu 


Arg 


Asp 


Lys 


Ser 


Leu Leu Asp Gin Ser Glu Val Thr 


Leu Asp 






115 








120 125 




Arg 


He 


Asp 


Asp 


Tyr 


Tyr 


He Ala Gly Thr Pro Thr Ser Ala 


Thr Ser 




130 










135 140 




Val 


Gly 


Gly 


He 


Tyr 


Ala 


Ala Gly Asp Val Leu Lys His Glu 


Gly Lys 


145 










150 


155 


160 


Leu 


His 


Leu 


He 


Ala 


Gly 


Ala Phe Gin Asp Ala Ala Asn Ala 


Val Asn 










165 


170 


175 


Gin 


Ala 


Lys 


Gin 


Trp 


lie 


Glu Pro Glu Ala His Gin Ser Ala 


Met. Val 








180 






185 190 




Ser 


Ser 


His 


Asn 


His 


Val 


Phe Lys Glu Arg Asn Arg Glu Leu 


He Arg 






195 








200 205 




Gin 


Met 


Leu 


Lys 


Asn 










210 















<210> 238 
<211> 136 
<212> PRT 

<213> Bacillus halodurans 
<400> 238 

Met Asn Trp Glu Glu Leu Tyr Asp Val Thr He He Gly Gly Gly Pro 

1 5 10 15 

Ala Gly Leu Phe Ser Ala Phe Tyr Ser Gly Leu Arg Glu Met Lys Thr 

20 * 25 30 

Lys Val He Glu Tyr Gin Pro Met Leu Gly Gly Lys Val His Val Tyr 

35 40 45 

Pro Glu Lys Met He Trp Asp Val Gly Gly Leu Thr Pro He Leu Gly 

50 55 60 

Glu Lys Leu He Glu Gin Leu Val Thr Gin Ala Leu Thr Phe Asn Pro 
65 70 75 80 

Thr Val Val Leu Asn Glu Lys Val Thr Ser He Ala Gin Glu Glu Ser 

85 90 95 

Gly Trp Phe Val He Arg Thr Ala Ser Gly Arg Ala His Leu Thr Lys 

100 105 110 

Thr Val He He Ala Val Gly Gly Gly He Leu Lys Pro Gin Lys Asn 

115 120 125 

Arg Ala Arg Arg Gly Arg Thr He 
130 135 



<210> 239 
<211> 312 
<212> PRT 

<213> Campylobacter jejuni 
<400> 239 

Met Leu Asp Val Ala He He Gly Gly Gly Pro Ala Gly Leu Ser Ala 

1 5 10 15 

Gly Leu Tyr Ala Thr Arg Gly Gly Leu Lys Asn Val Val Met Phe Glu 
20 25 30 
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Lys Gly Met Pro Gly Gly Gin lie Thr Ser Ser Ser Glu lie Glu Asn 

35 40 45 

Tyr Pro Gly Val Ala Gin Val Met Asp Gly lie Ser Phe Met Ala Pro 

50 55 60 

Trp Ser Glu Gin Cys Met Arg Phe Gly Leu Lys His Glu Met Yal Gly 
65 70 75 80 

Val Glu Gin He Leu Lys Asn Ser Asp Gly Ser Phe Thr He Lys Leu 

85 ~ 90 95 

Glu Gly Gly Lys Thr Glu Leu Ala Lys Ala Val He Val Cys Thr Gly 

100 105 110 

Ser Ala Pro Lys Lys Ala Gly Phe Lys Gly Glu Asp Glu Phe Phe Gly 

115 120 125 

Lys Gly Val Ser Thr Cys Ala Thr Cys Asp Gly Phe Phe Tyr Lys Asn 

130 135 140 

Lys Glu Val Ala Val Leu Gly Gly Gly Asp Thr Ala Leu Glu Glu Ala 
145 150 155 160 

Leu Tyr Leu Ala Asn He Cys Ser Lys He Tyr Leu He His Arg Arg 

165 170 " 175 

Asp Glu Phe Arg Ala Ala Pro Ser Thr Val Glu Lys Val Lys Lys Asn 

180 185 190 

Glu Lys He Glu Leu He Thr Ser Ala Ser Val Asp Glu Val Tyr Gly 

195 200 205 

Asp Lys Met Gly Val Ala <?ly Val Lys Val Lys Leu Lys Asp Gly Ser 

210 215 220 

He Arg Asp Leu Asn Val Pro Gly He Phe Thr Phe Val Gly Leu Asn 
225 230 ~ 235 240 

Val Arg Asn Glu He Leu Lys Gin Asp Asp Ser Lys Phe Leu Cys Asn 

245 " 250 255 

Met Glu Glu Gly Gly Gin Val Ser Val Asp Leu Lys Met Gin Thr Ser 

'260 265 270 

Val Ala Gly Leu Phe Ala Ala Gly Asp Leu Arg Lys Asp Ala Pro Lys 

275 280 285 

Gin Val He Cys Ala Ala Gly Asp Gly Ala Val Ala Ala Leu Ser Ala 

290 295 300 

Met Ala Tyr He Glu Ser Leu His 
305 310 



<210> 240 
<211> 348 
<212>- PRT 

<213> Caulobacter crescentus 



<400> 240 



Met 


Ser 


Pro 


Leu 


Arg 


Arg 


He 


His Thr He Ser Pro Pro Met Ser Thr 


1 








5 




10 15 


Leu 


Ser 


Pro 


Arg 


Gin 


Thr 


Arg 


Cys Leu He He Gly Ser Gly Pro Ala 








20 








25 30 


Gly 


Tyr 


Thr 


Ala 


Ala 


He 


Tyr 


Ala Ala Arg Ala Leu Leu Lys Pro Val 






35 










40 " 45 


Leu 


He 


Ala 


Gly 


He 


Gin 


Pro 


Gly Gly Gin Leu Thr He Thr Thr Asp 




50 










55 


60 


Val 


Glu 


Asn 


Tyr 


Pro 


Gly 


Phe 


Ala Asp Val He Gin Gly Pro Trp Leu 


65 










70 




75 80 


Met 


Asp 


Gin 


Met 


Arg 


Ala 


Gin 


Ala Glu His Val Gly Thr Glu Phe Val 










85 






90 <95 


Ser 


Asp 


He 


Val 


Thr 


Ser 


Val 


Asp Leu Ser Lys Arg Pro Phe Thr Val 








100 








105 110 


Lys 


Thr 


Asp 


Ser 


Gly 


Gin 


Asp 


Trp He Ala Glu Thr He He He Ala 






115 










120 125 


Thr 


Gly 


Ala 


Gin 


Ala 


Lys 


Trp 


Leu Gly Leu Glu Ser Glu Ala Lys Phe 




130 










135 


140 


Gin 


Gly 


Phe 


Gly 


Val 


Ser 


Ala 


Cys Ala Thr Cys Asp Gly Phe Phe Tyr 


145 










150 




155 160 


Arg 


Asn 


Lys 


Asp 


Val 


He 


Val 


Val Gly Gly Gly Asn Thr Ala Val Glu 










165 






170 175 


Glu 


Ala 


Leu 


Phe 


Leu 


Thr 


Ser 


Phe Ala Ser Lys Val Thr Leu Val His 
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180 










185 190 


Arg 


Lys 


Asp 


Glu 


Leu 


Arg 


Ala 


Glu 


Lys He Leu Gin Glu Arg Leu Leu 






195 










200 


205 


Ala 


His 


Pro 


Lys 


He 


Glu 


Val 


He 


Trp Asp Ser Val He Asp Glu Val 




210 










215 




220 


Leu 


Gly 


Gin 


Thr 


Asp 


Pro 


Met 


Gly 


Val Thr Gly Ala Arg Leu Lys Asn 


225 










230 






235 240 


Val 


Lys 


Thr 


Gly 


Glu 


Thr 


Gin 


Glu 


Val Ala Ala Asp Gly Val Phe He 










245 








250 255 


Ala 


He 


Gly 


His 


Ala 


Pro 


Ser 


Ser 


Glu Leu Phe Ala Gly Gin Leu Glu 








260 










265 270 


Thr 


Gly 


Ser 


Gly 


Gly 


Tyr 


Leu 


Lys 


Val Lys Pro Gly Thr Ala Ser Thr 






275 










280 


285 


Ala 


He 


Glu 


Gly 


Val 


Tyr 


Ala 


Ala 


Gly Asp Val Thr Asp Asp Val Tyr 




290 










295 




300 


Arg 


Gin 


Ala 


Val 


Thr 


Ala 


Ala 


Gly 


Met Gly Cys Met Ala Ala Leu Glu 


305 










310 






315 320 


Ala 


Val 


Arg 


Phe 


Leu 


Ala 


Glu 


Glu 


Asp His Lys Ala Ala His His Pro 










325 








330 335 


He 


Ser 


His 


Ala 


Glu 


Ala 


Asn 


Lys 


He Gly Val Trp 








340 








345 



<210> 241 
<211> 285 
<212> PRT 

<213> Clostridium acetobutylicum 



<400> 241 



Met 


Glu 


Arg 


Tyr 


Asp 


He 


Ala 


He 


He Gly Ser Gly Pro Ala Gly Leu 


1 








5 








10 15 


Ala 


Ser 


Ala 


He 


Asn 


Ala 


Lys 


Thr 


Arg Asn Lys Ser Val He Val Phe 








20 








25 30 


Gly 


Ser 


Ser 


Asp 


Leu 


Ser 


Lys 


Lys 


Leu Thr Leu Ala Pro Val He Asn 






35 








40 


45 


Asn 


Tyr 


Leu 


Gly 


Phe 


Tyr 


Gly 


He 


Arg Gly Ala Glu Leu Gin Glu Lys 




50 










55 




60 


Phe 


Lys 


Glu 


His 


He 


Asp 


Asn 


Met 


Gly He Gin He Glu Asn Val Lys 


65 










70 






75 80 


Val 


Asn 


Asn 


He 


Tyr 


Ala 


Met 


Gly 


Glu Tyr Phe Ser He Met Thr Ser 










85 








90 95 


Lys 


Asp 


Thr 


Tyr 


Glu 


Ala 


Ser 


Lys 


Val He Leu Ala Met Gly Met Glu 








100 










105 110 


His 


Thr 


Lys 


Pro 


Leu 


Lys 


Gly 


Glu 


Asp Lys Phe Leu Gly Arg Gly Val 






115 










120 


125 


Gly 


Tyr 


Cys 


Ala 


Thr 


Cys 


Asp 


Ala 


Pro Leu Tyr Lys Gly Lys He Val 




130 










135 




140 


Thr 


He 


Val 


Gly 


Tyr 


Asn 


Lys 


Glu 


Ala Glu Ser Glu Ala Asn Tyr Leu 


145 










150 






155 160 


Ala 


Glu 


Leu 


Ala 


Ser 


Lys 


Val 


Tyr 


Tyr Val Pro Arg Tyr Lys Asp Glu 










165 








170 175 


Tyr 


Gin 


Leu 


Val 


Ser 


Ala 


Val 


Glu 


He Val Lys Asp Val Pro Val Glu 








180 










185 ~ 190 


He 


Val 


Gly 


Asp 


Lys 


Lys 


Val 


Glu 


Lys Leu Lys Leu Lys Ser Arg Glu 






195 










200 


205 


Leu 


Glu 


Thr 


Asp 


Gly 


Val 


Phe 


Val 


Leu Lys Asp Ser Ala Pro- Pro Glu 




210 










215 




220 


Gin 


Leu 


Val 


Pro 


Gly 


Leu 


Tyr 


Val 


Glu Asp Gly His He Lys Val Asn 


225 










230 






235 240 


Arg 


Lys 


Met 


Glu 


Thr 


Asn 


He 


Asp 


Gly Cys Tyr Ala Ala Gly Asp Cys 










245 








250 255 


Thr 


Gly 


Lys 


Pro 


Tyr 


Gin 


Tyr 


Met 


Lys Ala Val Gly Glu Gly Gin Val 








260 










265 270 


Ala 


Ala 


Leu 


Asn 


Ala 


Val 


Glu 


Lys 


Leu Tyr Thr Lys Ala 






275 










280 


285 
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<210> 242 
<211> 291 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 242 

Met Asp Arg Tyr Asp lie Ala lie lie Gly Ser Gly Pro Ala Gly Leu 

1 5 10 15 

Ser Ala Ala lie Asn Ala Val lie Arg Asn Lys Lys Val lie Leu Phe 

20 25 " 30 

Gly Ser Asp Asn Leu Ser Asn Lys Leu Leu Lys Ala Pro Lys He Asn 

35 40 45 

Asn Tyr Leu Gly He Tyr Asp Val Ser Gly Lys Glu Leu Lys Glu Lys 

50 55 60 

Phe Leu Glu His Leu Lys Tyr Met Asn He Glu He Lys Asn Glu Lys 
65 70 75 80 

Val Asn Ser Val Tyr Ser Met Gly Asp Tyr Phe Ala Leu Ser Leu Asn 

85 90 95 

Gin Lys Met Tyr Glu Ala Thr Ser He He He Ala Ser Gly Val Glu 

100 105 110 

Phe Ser Lys Pro Leu Asn Gly Glu Asp Glu Leu Leu Gly Lys Gly Val 

115 120 ~ 125 

Gly Tyr Cys Ala Thr Cys Asp Ala Pro Leu Tyr Lys Gly Lys Thr Val 

130 135 140 

Ala He Val Gly Tyr Thr Lys Glu Ala Glu Glu Glu Ala Asn Tyr Val 
145 ■ 150 155 160 

Ser Glu Leu Ala Gly Lys Leu Tyr Tyr He Pro Met Tyr Lys Asp Lys 

165 170 175 

Val Ser Leu Lys Glu Val He Glu Val Val Glu Asp Lys Pro He Ser 

180 185 190 

He Leu Gly Lys Asp Lys Val Ser Gly Leu Gin Met Ser Lys Gly Glu 

195 200 205 

He Asn Thr Asp Ala Val Phe He He Lys Asp Ser Val Ser Pro Gly 

210 215 220 

Lys Leu Val Pro Gly Leu Leu Met Asn Gly Glu His He Ala Val Asp 
225 230 235 240 

He Asp Met Lys Thr Asn He Glu Gly Cys Phe Ala Ala Gly Asp Cys 

245 250 255 

Ala Gly Arg Pro Tyr Gin Tyr He Lys Ser Ala Gly Gin Gly Gin He 

260 265 270 

Ala Ala Leu Ser Ala Val Ser Tyr He Asp Lys He Lys Leu Asn Lys 

275 280 285 

Lys He He 
290 



<210> 243 
<211> 314 
<212> PRT 

<213> Clostridium sticklandii 



<400> 243 



Met 


Ser 


Lys 


He 


Tyr 


Asp 


Leu 


Val He He Gly Ala Gly Pro Ala Gly 


1 








5 






10 15 


Leu 


Ser 


Ala 


Gly 


Leu 


Tyr 


Gly 


Ala Arg Gly Lys Met Ser Thr Leu He 








20 








25 * 30 < 


He 


Glu 


Lys 


Asp 


Lys 


Thr 


Gly 


Gly Gin He Val Thr Thr Glu Glu Val 






35 










40 45 


Ala 


Asn 


Tyr 


Pro 


Gly 


Ser 


He 


His Asp Ala Ser Gly Pro Ser Leu lie 




50 










55 


60 


Ala 


Arg 


Met 


Ala 


Glu 


Gin 


Ala 


Asp Glu Phe Gly Thr Glu Arg He Lys 


65 










70 




75 80 


Asp 


Ser 


He 


Val 


Asp 


Phe 


Asp 


Phe Thr Gly Lys He Lys He Leu Lys 










85 






90 95 


Gly 


Thr 


Lys 


Ala 


Glu 


Tyr 


Gin 


Ala Lys Ala Val He Val Ala Thr Gly 








100 








105 110 


Ala 


Ser 


Pro 


Lys 


Lys 


Leu 


Asp 


Cys Pro Gly Glu Lys Glu Leu Thr Gly 
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115 






120 


Lys 


Gly 


Val Ser Tyr 


Cys 


Ala 


Thr 




130 






135 




Met 


Glu 


Val Phe Val 


Val 


Gly 


Gly 


145 






150 






Met 


Tyr 


Leu Thr Lys 


Phe 


Ala 


Ser 




165 








Asp 


Ser 


Leu Arg Ala 


Ala 


Lys 


Ser 






180 








Pro 


Lys 


He Asp Phe 


Lys 


Trp 


Asp 






195 






200 


Asp 


Gly 


He Val Glu 


Ser 


Val 


Val 




210 






215 




Leu 


Ser 


Glu His Phe 


Ala 


Asp 


Glu 


225 






230 




Val 


Phe 


Thr Gly Tyr 


He 


Pro 


Gin 






245 








Asp 


Met 


Asn Gin Ser 


Gly 


Tyr 


Phe 






260 








Asn 


He 


Pro Gly Val 


Phe 


Ala 


Ala 






275 






280 


Arg 


Gin 


Val Val Thr 


Ala 


Thr 


Ala 


290 






295 




Ala 


Glu 


Lys Tyr He 


Glu 


His 


Glu 



305 310 



<210> 244 
<211> 325 
<212> PRT 

<213> Deinococcus radiodurans 



<400> 244 



Met 


Thr 


Ala 


Pro 


Thr 


Ala 


His 


Asp 


1 








5 








Gly 


Pro 


Ala 


Gly 


Leu 


Thr 


Ala 


Ala 








20 










Ser 


Thr 


Leu 


He 


Leu 


Glu 


Lys 


Gly 






35 










40 


Ser 


Glu 


Glu 


Val 


Glu 


Asn 


Phe 


Pro 




50 










55 




Met 


Glu 


Leu 


Ala 


Gin 


Arg 


Met 


His 


65 










70 






Lys 


Val 


Glu 


Met 


Asp 


Glu 


Val 


Gin 










85 








His 


Pro 


Tyr 


Pro 


Phe 


Thr 


Val 


Arg 








100 










Lys 


Ala 


Val 


He 


Leu 


Ala 


Thr 


Gly 






115 










120 


Pro 


Gly 


Glu 


Asp 


Asn 


Phe 


Trp 


Gly 




130 










135 




Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 


Lys 


Gly 


145 










150 






Gly 


Asp 


Ala 


Ala 


Val 


Glu 


Glu 


Gly 










165 








Glu 


Val 


Thr 


Val 


He 


His 


Arg 


Arg 








180 










Ala 


Gin 


Ala 


Arg 


Ala 


Phe 


Ala 


Asn 






195 








200 


Thr 


Ala 


Val 


Glu 


Glu 


He 


Gin 


Gly 




210 










215 


Leu 


Arg 


Asn 


Leu 


Lys 


Thr 


Gly 


Glu 


225 










230 






Val 


Phe 


He 


Phe 


He 


Gly 


His 


Val 










245 








Thr 


Val 


Ser 


Leu 


Arg 


Asp 


Asp 


Gly 



260 



125 



Cys 


Asp 


Ala 


Asp 


Phe 


Phe 


Gin 


Asp 








140 










Gly 


Asp 


Ser 


Ala 


Val 


Glu 


Glu 


Ala 




155 










160 


Lys 


Val 


Thr 


He 


Val 


His 


Arg 


Arg 




170 










175 




He 


Gin 


Asp 


Lys 


Ala 


Phe 


Ala 


Asn 


185 










190 






Ser 


Val 


He 


Lys 


Glu 


He 


Lys 


Gly 










205 








Phe 


Glu 


Asn 


Thr 


Lys 


Thr 


Gly 


Glu 








220 










Glu 


Phe 


Gly 


Thr 


Phe 


Gly 


He 


Phe 






235 










240 


Thr 


Asp 


He 


Phe 


Lys 


Asp 


Lys 


Val 




250 










255 




Val 


Thr 


Asn 


Gin 


Asn 


Met 


Glu 


Thr 


265 










270 






Gly 


Asp 


Cys 


Arg 


Glu 


Lys 


Val 


Leu 










285 








Asp 


Gly 


Ala 


He 


Ala 


Ala 


He 


Met 






300 










Gly 


Leu 














Tyr 


Asp 


Val 


Val 


He 


He 


Gly 


Gly 




10 










15 




He 


Tyr 


Thr 


Gly 


Arg 


Ala 


Gin 


Leu 


25 










30 






Met 


Pro 


Gly 


Gly 


Gin 


He 


Ala 


Trp 










45 








Gly 


Phe 


Pro 


Glu 


Pro 


He 


Ala 


Gly 






60 










Gin 


Gin 


Ala 


Glu 


Lys 


Phe 


Gly 


Ala 






75 










80 


Gly 


Val 


Gin 


His 


Asp 


Ala 


Thr 


Ser 




90 










95 




Gly 


Tyr 


Asn 


Gly 


Glu 


Tyr 


Arg 


Ala 


105 










110 






Ala 


Asp 


Pro 


Arg 


Lys 


Leu 


Gly 


He 










125 








Lys 


Gly 


Val 


Ser 


Thr 


Cys 


Ala 


Thr 








140 










Lys 


Lys 


Val 


Val 


Val 


He 


Gly 


Gly 






155 










160 


Met 


Phe 


Leu 


Thr 


Lys 


Phe 


Ala 


Asp 




170 










175 




Asp 


Thr 


Leu 


Arg 


Ala 


Asn^ 


Lys 


Val 


185 










190 






Pro 


Lys 


Met 


Lys 


Phe 


He 


Trp 


Asp 










205 








Ala 


Asp 


Ser 


Val 


Ser 


Gly 


Val 


Lys 








220 










Val 


Ser 


Glu 


Leu 


Ala 


Thr 


Asp 


Gly 






235 










240 


Pro 


Asn 


Thr 


Ala 


Phe 


Val 


Lys 


Asp 




250 










255 




Tyr 


Val 


Asp 


Val 


Arg 


Asp 


Glu 


He 


265 










270 
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Tyr Thr Asn lie Pro Met Leu Phe Ala Ala Gly Asp Val Ser Asp Tyr 

275 280 285 

lie Tyr Arg Gin Leu Ala Thr Ser Val Gly Ala Gly Thr Arg Ala Ala 

290 295 300 

Met Met Thr Glu Arg Gin Leu Ala Ala Leu Glu Val Glu Gly Glu Glu 
305 310 315 320 

Val Thr Ala Ala Asp 
325 



<210> 245 
<211> 61 
<212> PRT 

<213> Enterococcus faecalis 
<220> 

<221> VARIANT 

<222> 33, 45, 46 

<223> Xaa = Any Amino Acid 

<400> 245 

Met Met Asp Thr Leu lie lie Glu Lys Asp Lys lie Gly Gly Gin Val 

1 5 10 15 

Thr Thr Thr Ser Glu lie Val Asn Tyr Pro Ala lie Arg His Thr Thr 

20 25 30 

Xaa Pro Glu Leu Met Gly Glu Met Arg He Gin Ala Xaa Xaa Phe Gly 

• 35 40 45 

Val Ala Phe Thr Lys Asp Glu He He Asp Val Asp Phe 
50 * 55 60 



• <210> 246 
<211> 205 
<212> PRT 

<213> Halobacterium sp 
<400> 246 

Met Thr Glu Asp Ser His Asp Leu Val He Ala Gly Ser Gly He Ala 

15 10 15 

Gly Leu Ser Ala Ala Val Tyr Ala Ala Arg Ala Asp Leu Glu Pro Leu 

20 25 30 

Val Leu Glu Gly Asp Glu Pro Gly Gly Gin Leu Thr Leu Thr Thr Asp 

35 40 45 

Val Glu Asn Tyr Leu Gly Phe Pro Asp Gly Val Gly Gly Met Asp Leu 

50 55 60 

Val Gin Arg Gly Lys Glu Gin Ala Glu Gin Phe Gly Ala Gin Phe Glu 
65 70 75 80 

His Gly Arg He Glu Ala Ala Asp Leu Asp Gly Gin Pro Leu Glu Leu 

85 90 95 

Ser Leu Ser Thr Gly Asp Thr Leu Tyr Thr Arg Ser Leu He Val Ala 

100 105 110 

Thr Gly Ala Ser Ala Arg Trp Val Gly Ala Glu Asn Glu Asp Glu Leu 

115 120 125 

Met Gly Ala Gly Leu Ser Thr Cys Ala Thr Cys Asp Gly Ala Phe His 

130 135 140 

Arg Gly Asp Asp Val Leu Val Val Gly Gly Gly Asp Ser Ala Met Glu 
145 150 155 160 

Glu Ala Leu Phe Leu Ala Lys Phe Ala Asp Ser Val Thr Val Val His 

165 170 175 

Arg Arg Glu Glu Leu Arg Ala Ser Glu He Met Ala Asp Arg Ala Arg 

180 185 190 

Asp His Asp Asp Val Gin Phe Arg Trp Asn Thr Glu Leu 
195 200 205 



<210> 247 
<211> 362 
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<212> PRT 

<213> Halobacterium sp 
<400> 247 

Met Thr Glu Ala Thr Ala Asp Arg Thr Ala Leu Thr Asp Gly Gly- Arg 

15 10 15 

Asp Val Val Glu His Arg Gin Leu Val lie Val Gly Ser Gly He Ala 

20 25 30 

Ala Leu Ser Ala Ala Thr Tyr Ala Ala Arg Ser Asn Asn Asp Pro Leu 

35 40 45 

Leu Phe Glu Gly Asp Glu Pro Gly Gly Gin Leu Thr Leu Thr Ser Glu 

50 55 60 

Val Glu Asn Tyr Pro Gly Phe Pro Glu Gly He Ala Gly Ala Glu Leu 
65 70 75 80 

He Gin Glu Met Lys Thr Gin Ala Thr Arg Phe Gly Ala Glu Val Glu 

85 90 95 

His Gly He Val Glu Ser Val Asp Asp Ser Gly Arg Pro Phe Arg Leu 

100 105 110 

Thr Leu Thr Asn Gly Asp Val Tyr Thr Ala Asp Ala Val He Val Ala 

115 120 125 

Ser Gly Ala Ser Ala Arg Thr Leu Gly He Pro Gly Glu Asp Glu Leu 

130 135 140 

Met Gly Gin Gly Val Ser Thr Cys Ala Thr Cys Asp Gly Ala Phe Phe 
145 150 155 - 160 

Arg Gly Glu Asp Met He Val Val Gly Gly Gly Asp Ala Ala Ala Glu 

165 170 175 ' 

Glu Ala Ser Phe Leu Thr Lys Phe Ala Asp Thr Val Tyr Leu Val His 

180 185 190 

Arg Arg Asp Glu Leu Arg Ala Glu Asp Tyr Trp Ala Asp Arg He Arg 

195 200 " 205 

Glu His Val Ala Asp Gly Asp He Glu Val Leu Trp Asn Thr Glu Ala 

210 215 220 

Val Glu Val His Gly Ser Pro Glu Glu Gly Val Thr Gly Ala Ser Leu 
225 230 235 240 

Val Arg His Pro Glu Gly His Pro Thr Ala Lys Leu Asp Ala Asp Glu 

245 250 255 

Thr Glu Gin Leu Glu Leu Asp He Gly Ala Phe Phe He Ala He Gly 

260 265 270 

His Thr Pro Asn Thr Ser Phe Leu Ala Asp Thr Gly Val Val Cys Asp 

275 280 ' 285 

Asp Ala Gly Tyr Val Gin Thr Val Gly Gly Ala Gly Gly Gly Gin Thr 

290 295 300 

Lys Thr Asp Val Thr Gly Val Phe Gly Ala Gly Asp Val Val Asp Tyr 
305 * 310 315 320 

His Tyr Gin Gin Ala Val Thr Ala Ala Gly Met Gly Ser Lys Ala Ala 

325 330 335 

He Asp Ala Asp Glu Tyr Leu Glu Ser Val Ala Asp Gly Val Thr Gly 

340 345 350 

Glu Thr Ala Asp Ala Thr Pro Ala Asp Asp 
355 360 



<210> 248 
<211> 294 
<212> PRT 

<213> Halobacterium 
<400> 248 

Met Pro Thr Gin Asp Gly Glu Arg 

1 5 
Gly Pro Ala Gly Cys Ala Ala Gly 
20 

Asp Thr Val Val Phe Asp Arg Gly 

35 40 
Phe Val Glu Asn Tyr Pro Gly Phe 

50 55 
Leu Arg Gly Leu Phe His Asp His 



Arg Asp Val Val He Val Gly Gly 

10 15 
Val Phe Thr Ala Arg Tyr Gly Leu 
25 30 
Asn Ala Ala Leu Pro Arg Cys Ala 
45 

Pro Gly Gly He Asp Val Pro Thr 
60 

Ala Glu Thr Ala Gly Cys Asp Leu 
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65 

He Ala 

Gly Phe 

Val Leu 

Gly Asp 
130 
Arg Phe 
145 

Gly Leu 

He Ala 

Arg Lys 

Trp Lys 
210 
Arg Trp 
225 

Asp Asp 

Phe Asp 

Gly Ala 

Tyr Arg 
290 



Asp Thr 

Val Val 
100 
Ala Ala 
115 

Ser Ala 

Asp Asp 

Tyr Val 

Ala Gly 
180 
Arg Ala 
195 

Arg Arg 

Arg Glu 

Asp Glu 

Ala Thr 
260 
His Arg 
275 

Glu Gin 



70 

Val Glu Ser 
85 

Glu Thr Gin 

Ala Trp Tyr 

Phe Glu Thr 
135 

Ala Tyr Ala 

150 
Ala Ser Pro 
165 

Asn Gly Ala 

Arg Gly Tyr 

Glu Ser Asp 
215 

Trp Phe Ala 

230 
Phe Ala Ala 
245 

Leu Ser Ala 
Leu Leu Asp 
Arg Asp 



Val Asp 

Asp Gly 
105 
Asp Gly 
120 

His Asp 

Asp Ala 

Gly Gly 

His Val 
185 
Pro Glu 
200 

Leu Ser 

Ala Glu 

Leu Arg 

Asp Ala 
265 
His He 
280 



75 
Arg Pro 
90 

Arg Arg 

Ser Tyr 

His His 

Asp Gly 
155 
Gin Arg 
170 

Ala Arg 

Gly Val 

Gly Glu 

Ala Gly 
235 
Ala Ala 
250 

He Glu 
Asp Asp 



Ser Asp 

Val Tyr 

Leu Arg 
125 
Gly Glu 
140 

Arg Thr 

Ser Ala 

Cys Leu 

Ala Pro 
205 
Trp Ala 
220 

Asp Asp 

His Leu 

Glu Arg 

Asp His 
285 



80 

Asp Asp Thr 
95 

Thr Asp Thr 
110 

Pro Val Val 

Ser Arg Glu 

Pro Val Asp 
160 

Gin Ala Val 

175 
Leu Ala Asp 
190 

His Tyr Asp 

Asp Arg Asp 

His Asp Leu 
240 

Asp Arg Thr 

255 
Ala Glu- Ala 
270 

He Glu Ser 



<210> 249 
<211> 324 
<212> PRT 

<213> Helicobacter pylori 



<400> 249 



Met 


Asn 


Gin 


Glu 


He 


Leu 


Asp Val 


Leu He Val Gly Ala Gly Pro Gly 


1 








5 






10 15 


Gly 


He 


Ala 


Thr 


Ala 


Val 


Glu Cys 


Glu He Ala Gly Val Lys Lys Val 








20 








25 30 


Leu 


Leu 


Cys 


Glu 


Lys 


Thr 


Glu Ser 


His Ser Gly Met Leu Glu Lys Phe 






35 








40 


45 


Tyr 


Lys 


Ala 


Gly 


Lys 


Arg 


He Asp 


Lys Asp Tyr Lys Lys Gin Val Val 




50 










55 


60 


Glu 


Leu 


Lys 


Gly 


His 


He 


Pro Phe 


Lys Asp Ser Phe Lys Glu Glu Thr 


65 










70 




75 80 


Leu 


Glu 


Asn 


Phe 


Thr 


Asn 


Leu Leu 


Lys Glu His His He Thr Pro Ser 










85 






90 95 


Tyr 


Lys 


Thr 


Asp 


He 


Glu 


Ser Val 


Lys Lys Glu Gly Glu Tyr Phe Lys 








100 








105 110 


He 


Thr 


Thr 


Thr 


Ser 


Asn 


Thr Thr 


Tyr His Ala Lys Phe Val Val Val 






115 








120 


125 


Ala 


He 


Gly 


Lys 


Met 


Gly 


Gin Pro 


Asn Arg Pro Thr Ala Tyr Lys He 




130 










135 


140 


Pro 


Val 


Ala 


Leu 


Ser 


Lys 


Gin Val 


Val Phe Ser He Asn Asp* Cys Lys 


145 










150 




155 - 160 


Glu 


Asn 


Glu 


Lys 


Thr 


Leu 


Val He 


Gly Gly Gly Asn Ser Ala Val Glu 










165 






170 175 


Tyr 


Ala 


He 


Ala 


Leu 


Cys 


Lys Thr 


Thr Pro Thr Thr Leu Asn Tyr Arg 








180 








185 190 


Lys 


Lys 


Glu 


Phe 


Ser 


Arg 


He Asn 


Glu Asp Asn Ala Lys Asn Leu Gin 




195 








200 


205 


Glu 


Val 


Leu 


Asn 


Asn 


Asn 


Thr Leu 


Lys Ser Lys Leu Gly Val Asp He 




210 










215 


220 


Glu 


Ser 


Leu 


Glu 


Glu 


Asp 


Asn Thr 


Gin He Lys Val Asn Phe Thr Asp 


225 










230 




235 240 
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Asn Thr 


Ser 


Glu 


Ser 








245 


Thr Pro 


Leu 


Glu 


Phe 






260 




Thr Asn 


He 


Pro 


Val 




275 






Asn Leu 


Phe 


He 


Val 


290 








Ala Thr 


Ala 


Leu 


Asn 


305 








Arg Leu 


His 


Ser 





Phe Asp Arg Leu Leu Tyr 
250 

Phe Lys Arg Cys Ser Leu 
265 

Val Lys Glu Asn Leu Glu 
280 

Gly Asp He Leu Phe Lys 
295 

His Gly Tyr Asp Val Ala 
310 315 



Ala He Gly Gly Ser 
255 

Glu Leu Asp Pro Ser 
270 

Ser Asn Asn He Pro 
285 

Ser Gly Ala Ser He 
300 

He Glu He Ala Lys 
320 



<210> 250 

<211> 128 

<212> PRT 

<213> Klebsiella oxytoca 



<400> 250 



Met 
1 


Gly 


Thr 


Ala 


Lys 
5 


His 


Ser Lys 


Ala 


Gly 


Tyr 


Thr 


Ala 


Ala 


Val Tyr 








20 








Val 


Leu 


He 


Thr 


Gly 


Met 


Glu Lys 






35 






40 


Glu 


Val 


Glu 


Asn 


Trp 


Pro 


Gly Asp 




50 










55 


Leu 


Met 


Glu 


Arg 


Met 


His 


Glu His 


65 








70 




He 


Phe 


Asp 


His 


He 


Asn 


Ser Val 










85 






Leu 


Val 


Gly 


Asp 


Ser 


Gly 


Glu Tyr 








100 








Tyr 


Arg 


Arg 


He 


Ser 


Ala 


Leu Ser 






115 








120 



Leu Leu He Leu Gly Ser Gly Pro 

10 15 
Ala Ala Arg Ala Asn Leu Gin Pro 
25 30 
Gly Gly Gin Leu Thr Thr Thr Thr 
45 

Pro Asn Asp Leu Thr Gly Pro Leu 
60 

Ala Thr Lys Phe Glu Thr Glu He 

75 80 
Asp Leu Gin Asn Arg Pro Phe Arg 

90 95 
Thr Cys Asp Ala Pro Asp Tyr Arg 
105 110 
Gly Ser Ala He Gly Arg Arg Val 
125 



<210> 251 
<211> 79 
<212> PRT 

<213> Lactococcus lactis 
<400> 251 

Met Gin Glu Leu Asp Leu He He Val Gly Ala Gly Pro Val Gly Leu 

15 10 15 

Tyr Ala Ala Phe Tyr Ala Gly Met Arg Gly Leu Ser Val Ala He He 

20 25 30 

Glu Ser Ala Gin Val Pro Gly Gly Gin Pro Gin Asn Leu Tyr Pro Glu 

35 40 45 

Lys Leu He Tyr Asp He Ala Gly Leu Pro Ala Val Thr Gly Ala Asp 

50 55 60 

Leu Thr Lys Asn Leu Leu Glu Gin Leu Ala Gin He Ser His Arg 
65 70 75 



<210> 252 
<211> 321 
<212> PRT 

<213> Lactococcus lactis 
<400> 252 

Met Gin Glu Leu Asp Leu He He Val Gly Ala Gly Pro Val Gly Leu 

15 10 15 

Tyr Ala Ala Phe Tyr Ala Gly Met Arg Gly Leu Ser Val Ala He He 

20 25 30 

Glu Ser Ala Gin Val Pro Gly Gly Gin Pro Gin Asn Leu Tyr Pro Glu 
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35 




40 




Lys 


Leu 


Tie Tvr Asn 


lie 


Ala Gly 


Leu 




50 






55 




Leu 


Thx 


Lvs Asn Leu 


Leu 


Glu Gin 


Leu 


65 




70 






Phe 


Leu 


Gly Glu Sex 


Val 


Gin Lys 


lie 






85 








Sex 


Val 


Thx Thx Asn 


Lys 


Sex Thx 


Arcr 






100 






105 


Thx 


Thx 


Glv Ala Glv 


Leu 


TiRu Lvs 


Pxo 






115 




120 




Glu 


Gin 

VJ-L LL 


X IIX UC U ./TlXO. 


xi&ll 


Glu Glv 


Lys 




130 






135 




Leu. 


Lys 


Glu PHf=» Glu 


Glv 
v*xy 


TiVR Asn 


Val 


145 






mo 






Sex 


Ala 


Lpn Asti Tim 


Sex 


Leu Met 


Leu 






165 








His 


Leu 


Val His Axa 




Thx Ala 


Phe 






180 






185 


Asp 


«xy 


Val Met Asn 


Sex 


Asn Val 

luJii V OX. 


Gin 






195 




200 




Sex 


Asn 


Leu Tie Glu 


Asn 


Glu Leu 

wXLt XICLl 


Glu 




210 






215 




(Tin 


Odi 


J-icu n&ii xriie 


OCX. 


Tl A R CT"i 


T,vo 

uyo 














XJC LI 




Asn Gin Val 

noil uixi Vdx 




Ti^u 2Vl a 
xjc u /via 


Glu 

V31U 






245 








Gl v 
vjxy 


Axg 


Val T.vfl Ala 


Asp 




1*1 C L. 












£t O J 


Tyx 


Val 


Ala Gly Asp 


Ala 


Sex Asp 


Tyx 






275 




280 




Sex 


Val 


Gly Phe Gly 


Glu 


Ala Val 


His 




29,0 






295 




Lys 


Leu 


Glu Phe Asp 


His 


Pxo Leu 


Axg 


305 






310 






Phe 













<210> 253 

<211> 308 

<212> PRT 

<213> Lactococcus lactis 



<400> 253 



Met 
1 


Thx 


Glu 


Lys 


Lys 
5 


Tyx 


Asp Val 


Val 


Gly 


Met 


Thx 


Ala 


Ala 


Met 


Tyx Thx 


Ala 






20 






25 


Leu 


Leu 


Glu 


Axg 


Gly 


Val 


Pxo Gly 


Gly 






35 








40 




He 


Glu 


Asn 


Tyx 


Pxo 


Gly 


Tyx Glu 


Thx 




50 










55 




Met 


Lys 


Met 


Ala 


Glu 


Pxo 


Leu Glu 


Gly 


65 










70 




Gly 


Phe 


Val 


Thx 


Ala 


He 


Glu Asp 


His 








85 






Thx 


Glu 


Asp 


Asp 


Glu 


Phe 


Val Thx 


Lys 








100 








105 


Ala 


Asn 


His 


Axg 


Lys 


Leu 


Glu He 


Pxo 






115 








120 




Axg 


Gly 


Val 


Sex 


Tyx 


Cys 


Ala Val 


Cys 




130 










135 




Gin 


Glu 


He 


Leu 


Val 


He 


Gly Gly 


Gly 


145 










150 






Leu 


Tyx 


Leu 


Thx 


Axg 


Phe 


Gly Gin 


Sex 



165 



PCTYUS01/50240 







45 








Pxo Ala 


Val 


Thx 


Glv 

J; 


Ala Asp 




60 










Ala Gin 


He 


Sex 


His 


Axg Leu 


75 










80 


Glu Lys 


Glu 


Glu 


Glv 


He 


Phe 


90 








95 




Axg Ala 


Lys 


Glv 


Val 


Leu 


Leu 








110 






Axg Lys 


Leu 


Glv 


He 


Asp Asn 






125 








11C OCX 


TVr 
xyx 


Phe 


He 


Thx 


Sex 




140 










Ala Val 


Phe 


Glv 

\jxy 


Glv 

vjxy 


Gly Asp 


155 










160 


Glu Lvs 

U1U XJy 0 


Val 


Ala 


Lys 


Asn 


Val 


170 








175 




Axg Gly 


His 


Glu 


He 


Thx 


Val 








190 






Val His 


Thx 


Pxo 


Tvx 

X J X 


Thx 


Phe 






205 






Leu Lvs 


Lys 


He 


Lys 


Sex 


Glu 




220 










Tie Tien 

lie XI C Li. 


Val 


Asn 


xyx 


Gly Phe 


235 












Lcjn T»eu 


Glu 


Val 


Sex 


Axg 


Asn 


250 








o c c 




Gin Sex 


Asn 


He 


Glu 


Caxy 


T an 








270 




Pxo Gly 


Lys 


Met 


Pxo 


Leu 


jYieu 






285 








Ala He 


Asn 


Ala 


Met 


lnx 


Lys 




J \J \J 










Gly Gly 


His 


Sex 


Sex 


Sex 


Tl *a 

lie 


315 












He He 


Glv 
vjxy 


Sex 


Glv 

v»xy 


Pxo 


Ala 


10 








15 




Axg Sex 


Glu 


Met 


Lys 


Thx 


Leu 








30 






Gin Met 


Asn 


Asn 


Thx 


Ala 


Glu 






45 








Tie Met 

X X C 11C I— 


Glv 


Pxo 


Glu 


Leu 


Sex 




60 










T.en Glv 
XI C ui vjx y 


Val 


Glu 


Asn 


Ala 


Tyx 


75 










80 


Glv Asn 


Tvr 

xyx 


Lys 


Lys* 


He 


He 


90 








95 




Sex He 


He 


He 


Ala 


Thx 


Gly 








110 




Gly Glu 


Glu 


Glu 


Tvx 

xyx 


Gly 


Ala 






125 








Asp Gly 


Ala 


Phe 


Phe 


Axg 


Asn 




140 










Asp Sex 


Ala 


Val 


Glu 


Glu 


Ala 


155 










160 


Val Thx 


He 


Met 


His 


Axg 


Axg 


170 








175 
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Asp 


Lys 


Leu 


Arg 


Ala 


Gin 


Glu 


He 






180 










Glu 


Lys 


He 
195 


As n 


Phe 


He 


Trp 


Asp 
200 


Asp 


Asp 
210 


Lys 


Lys 


Val 


Gin 


Ser 
215 


Val 


Glu 


Val 


Thr 


Glu 


Lys 


Ala 


Phe 


Gly 


225 










230 




Asp 


Pro 


Val 


Ala 


Glu 


Phe 


Ala 


Gly 










245 






Gly 


Trp 


He 


He 
260 


Thr 


Asp 


Asp 


His 


Phe 


Ala 


Val 
275 


Gly 


Asp 


Val 


Arg 


Gin 
280 


Ala 


He 
290 


Gly 


Asp 


Gly 


Ala 


Gin 
295 


Ala 


Ala 


Glu 


Leu 


Asp 










305 















He Gin Gin Arg Ala Phe Lys Glu 
185 ^ 190 

Ser Val Pro Met Glu He Lys Gly 
205 

Val Tyr Lys Asn Val Lys Thr Gly 
220 

Gly He Phe He Tyr Val Gly Leu 
235 * 240 

Asn Leu Gly He Thr Asp Glu Ala 

250 ' 255 
Met Arg Thr Ser Leu Pro Gly He 
265 ~ 270 
Lys Asp Phe Arg Gin He Thr Thr 
285 

Ala Gin Glu Ala Tyr Lys Phe Val 
300 



<210> 254 
<211> 44 
<212> PRT 

<213> Lactococcus lactis 
<400> 254 

Met Gin Glu Leu Asp Leu lie lie Val Gly Ala Gly Pro Val Gly Leu 

1 5 10 ' 15 

Tyr Ala Ala Phe Tyr Ala Gly Met Arg Gly Leu Ser Val Ala He He 

20 25 30 

Glu Ser Ala Gin Val Pro Gly Gly Gin Pro Gin Asn 
35 40 



<210> 255 
<211> 339 
<212> PRT 

<213> Listeria monocytogenes 



<400> 255 



Glu 


Phe 


Tyr 


Ser 


Tyr 


Lys 


Lys 


Glu He Asn Arg Tyr Leu Ala Glu Glu 


1 








5 






10 15 


Asp 


Ser 


Ala 


Ser 


Ala 


Cys 


Asp 


He Leu Arg Lys Val He Asp Glu Lys 








20 








25 30 


Pro 


Asn 


Phe 


Trp 


Pro 


Ala 


Tyr 


Asn Gin Leu Ala Ser Leu Tyr Phe Glu 






35 








40 45 


Gin 


Leu 


Lys 


Glu 


Glu 


Glu 


Gly 


Val Arg Val Leu Ser Asp Leu Leu Ser 




50 










55 


60 


Arg 


Asn 


Pro 


Gly 


Asn 


Leu 


Leu 


Gly He Cys Asp Leu Phe He Tyr His 


65 










70 




75 80 


Phe 


Tyr 


Lys 


Gly 


Asn 


Arg 


Lys 


Glu Ala Asp Glu Leu Tyr Leu Glu Leu 










85 






90 95 


Arg 


Asp 


Val 


Leu 


Pro 


Val 


Leu 


Ala His His Lys Glu Lys Leu Gly Leu 








100 








105 110 


He 


His 


Ala 


Met 


Met 


Gly 


Glu 


Tyr Glu Glu Ala Asp Asp Leu Leu Glu 






115 










120 " 125 


Gin 


Val 


Ala 


Asp 


Leu 


Glu 


Val 


Thr Glu Arg Ser Lys Tyr Tyr Tyr Phe 




130 










135 


140 


Arg 


Ala 


Lys 


Ser 


Ser 


Tyr 


Tyr 


Leu Gly Asp Val Glu Gly Ala Lys Met 


145 










150 




155 ~ 160 


Phe 


Trp 


His 


Ser 


Phe 


Leu 


Glu 


Cys Asp Leu Tyr Glu Asp Val Arg Phe 










165 






170 175 


Pro 


Trp 


Glu 


Gin 


Glu 


Pro 


Asp 


Leu Thr Asn Asp Thr Arg Leu Val Leu 








180 








185 190 


Glu 


Met 


Leu 


Gin 


Glu 


Glu 


Asp 


Asp Leu Thr His Met Leu Gly Val Tyr 






195 










200 205 


Ala 


Leu 


Thr 


He 


Ser 


Gly 


Asn 


Arg Pro Glu Leu Val Leu Phe His Pro 
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210 










215 




Leu 


uc u. 


2\ en 


Met 




Sen 




Ser 


225 










230 






Asn 






TVr 
±yx 


Phe 
245 


Pro 


Asp 


Glv 


He 


Ala 


Lys 


Ala 


Met 


He 


He 


Leu 






260 










Glu 


Glu 


275 


Met 


Ala 


Leu 




Lys 
280 


Asp 


Ala 
290 


Gly 


Lys 


Asp 


Leu 


He 
295 


Leu 


Ala 


Ser 


Ala 


He 


Ala 


Lys 


Leu 


Phe 


305 










310 






Val 


Glu 


Glu 


Phe 


Glu 
325 


Cys 


Ser 


Lys 


Leu 


Ser 


Arg 













220 



Tyr 


Met 


Glu 


His Leu 


Met 


Phe 


Thr 






235 








240 


Ala 


He 


Glu 


Gin Asn Gly Tyr Leu 




250 








255 




Lys 


Glu 


Asn 


Gly He 


Leu 


Leu 


Asn 


265 








270 






Gin 


Met 


Phe 


Ser Leu 


Val 


Leu 


He 








285 








Gly 


Arg 


Tyr 


Thr He 


Glu 


Thr 


Val 








300 








Leu 


Pro 


His 


Leu Lys 


Leu 


Gin 


Leu 






315 






320 


Cys 


Ala 


Arg 


Asp He Glu Arg Val 




330 








335 





<210> 256 
<211> 303 
<212> PRT 

<213> Methanothermobacter thermautotrophicus 



<400> 256 



Mpf- 


rlCt 


XI LI. f\QLJ xyi -MUD^J 


L. 


X JLC 


V d X 


11C 


vji y 


2X1 a Gl T.r "Dyv*"* 11 a f2l 
xvLo VjXjf iri \J AJ. cl uiy 


X 




C 

. o 














ucu 


nil 


Al a Glv Tie* Tvr 


Glv 






Gin 


Glv 


c<=*-r <3f»T Val T.*=»n M^t- 

OC1 OCl. VAX JJCU rlCL 






on 






ZD 






Oft 
JV 


iJC u 




T ,vc Gl Prn 71 1 3 
XJ jr iD v?i _y nu rtl d 




vrx_y 




Gly 


ucu 


Glu Val Prn Mf»t* M<=t* 










*± u 








il ^ 


Glu 


Asn 


Tyr Pro Gly Phe 


Glu 


Met 


He 


Ala 


Gly 


Met Ser Leu Val Thr 




50 




55 










60 


Lys 


Met 


Lys Lys Gin Ala 


Thr 


Ala 


Val 


Ala 


Glu 


Leu Arg Glu Met Glu 


65 




.70 










75 


80 


Glu 


Val 


Lys Glu He Glu 


Lys 


Gly 


Asp 


Val 


Phe 


Thr Val Lys Thr Ser 






85 








90 




95 


Arg 


Asp 


Thr Tyr Thr Ala 


Ser 


Ala 


He 


He 


Phe 


Ala Thr Gly Ser Lys 






100 






105 






110 


His 


Arg 


Gin Leu Gly Val 


Pro 


Gly 


Glu 


Asn 


Asp 


Leu Leu Gly Arg Gly 






115 




120 








125 


Val 


Cys 


Tyr Cys Ala Thr 


Cys 


Asp 


Gly 


Pro 


Leu 


Tyr Lys Gly Arg Lys 




130 




135 










140 


Val 


Leu 


Met Val Gly Gly 


Gly 


Asn 


Ser 


Ala 


Ala 


Gin Glu Ala Val Phe 


145 




150 










155 


160 


Leu 


Lys 


Asn He Gly Cys 


Asp 


Val 


Ser 


He 


Val 


His Arg Arg Asp Glu 






165 








170 




175 


Leu 


Arg 


Ala Asp Lys Tyr 


Leu 


Gin 


Asp 


Lys 


Leu 


Arg Glu Met Glu He 






180 






185 






190 


Pro 


Val 


He Trp Asn Ser 


Val 


Val 


Lys 


Glu 


He 


Gly Gly Asp Glu Arg 






195 




200 








205 


Val 


Glu 


Glu Val He He 


His 


Asn 


Arg 


Val 


Thr 


Gly Arg Asp Glu Thr 




210 




215 










220 


Leu 


Lys 


Val Asp Gly Val 


Phe 


lie 


Ala 


He 


Gly 


Glu Glu Pro Leu Asn 


225 




230 










235 


240 


Gin 


Leu 


Ala Val Asp Leu 


Gly 


Val 


Glu 


Val 


Asp 


Lys Gly Gly Tyr He 






245 








250 




255 


He 


Thr 


Asp Lys Phe Gin 


Arg 


Thr 


Asn 


Val 


Pro 


Leu Val Tyr Ala Ala 






260 






265 






270 


Gly 


Asp 


He Thr Gly Gly 


Leu 


Asn 


Gin 


Trp 


Val 


Thr Ala Cys Ala Glu 






275 




280 








285 


Gly 


Ala 


He Ala Ala Thr 


Tyr 


Ala 


Tyr 


Arg 


Glu 


He Gin Ser Tyr 




290 




295 










300 



<210> 257 
<211> 179 
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<212> PRT 

<213> Bacillus subtilis 
<400> 257 

Met Val lie Ser Gly Gly Gly Asp Thr Ala Val Asp Trp Ala Asn Glu 

15 10 15 

Leu Glu Pro He Ala Ala Ser Val Thr Val Val His Arg Arg Glu Glu 

20 25 30 

Phe Gly Gly Met Glu Ser Ser Val Thr Lys Met Lys Gin Ser Ser Val 

35 40 45 

Arg Val Leu Thr Pro Tyr Arg Leu Glu Gin Leu Asn Gly Asp Glu Glu 

50 55 60 

Gly He Lys Ser Val Thr Val Cys His Thr Glu Ser Gly Gin Arg Lys 
65 70 75 80 

Asp He Glu He Asp Glu Leu He He Asn His Gly Phe Lys He Asp 

85 90 95 

Leu Gly Pro Met Met Glu Trp Gly Leu Glu He Glu Glu Gly Arg Val 

100 105 110 

Lys Ala Asp Arg His Met Arg Thr Asn Leu Pro Gly Val Phe Val Ala 

115 120 " 125 

Gly Asp Ala Ala Phe Tyr Glu Ser Lys Leu Arg Leu He Ala Gly Gly 

130 " 135 * 140 

Phe Thr Glu Gly Pro Thr Ala Val Asn Ser Ala Lys Ala Tyr Leu Asp 
145 150 155 160 

Pro Lys Ala Glu Asn Met Ala Met Tyr Ser Thr His His Lys Lys Leu 
165 170 175 

Val His Lys 



<210> 258 
<211> 307 
<212> PRT 

<213> Mycoplasma pulmonis 
<400> 258 

Met Ser Gin Asn Lys He Tyr Asp Val Ala He He Gly Ala Gly Pro 

15 10 15 

Gly Ala Leu Thr Ala Ala He Tyr Thr Ser Arg Gly Asn Leu Asp Thr 

20 25 30 

Val Phe He Asp Asn Ala Ala Pro Gly Gly Lys Leu He Tyr Ala Ser 

35 40 45 

Lys He Glu Asn Trp Pro Gly Asp Thr He Val Lys Gly Thr Asp Leu 

50 55 60 

Ala He Arg Phe Phe Glu His Ala Gin Ala Phe Gly Ala Lys Tyr Glu 
65 70 75 80 

Tyr Gly Lys Val Val Asp Leu He Aan He Lys Asp Asp Leu Lys Glu 

85 90 95 

Leu Val Leu Glu Asp Gly Lys Lys He Gin Ala Lys Ser Val He He 

100 105 " 110 

Ala Ser Gly Met Val Ser Arg Lys Pro Arg Glu He Leu Asn Tyr Asp 

115 120 ~ 125 

Glu Phe Glu Asn Arg Gly Val Ser Tyr Cys Val He Cys Asp Gly Pro 

130 135 140 

Met Tyr Gly His Asn Pro Ala He He He Gly Gly Gly Asn Ser Ala 
145 150 155 ' 160 

Val Glu Glu Gly Thr Phe Leu Ser Ser He Ala Ser Lys Val Tyr Val 

165 170 175 

He Val Arg Asp Ser Asp Phe He Ala Glu Lys Ala Leu Val Asn Asp 

180 185 190 

Leu Lys Ser Arg Lys Asn He Glu Val Leu Phe Asn Ala Ser Val Lys 

195 200 205 

Glu Leu His Gly Lys Asp Ala Leu Glu Tyr Ala He Val Asn His Asn 

210 215 220 

Gly Lys Glu Val Lys Leu Glu Val Ala Ser Leu Phe Pro Tyr He Gly 
225 230 235 240 

Phe Leu Pro Ser Ala Glu Tyr Ala Lys Asn Ala Gly Val Leu Glu Pro 
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245 250 255 

Asn Gly Phe lie Lys Thr Asp Glu Phe Met Glu Thr Lys Val Pro Gly 

260 ~ 265 270 

lie Tyr Ala lie Gly Asp lie Arg lie Lys Asp lie Arg Gin lie Leu 

275 280 ~ 285 

Thr Ala Thr Ser Asp Gly Thr lie Ala Gly Lys lie Leu Thr Asn Arg 

290 295 300 

lie Lys Lys 
305 



<210> 259 
<211> 316 
<212> PRT 

<213> Neisseria meningitidis 
<400> 259 

Met Ser Gin His Arg Lys Leu lie lie Leu Gly Ser Gly Pro Ala Gly 

15 10 15 

Tyr Thr Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Asn Pro Val lie 

20 " 25 30 

lie Thr Gly lie Ala Gin Gly Gly Gin Leu Met Thr Thr Thr Glu Val 

35 40 45 

Asp Asn Trp Pro Ala Asp Ala Asp Gly Val Gin Gly Thr Glu Leu Met 

50 55 60 

Ala Arg Phe Leu Ala His Ala Glu Arg Phe Gly Thr Glu He lie Phe 
65 70 75 80 

Asp Gin He Asn Ala Val Asp Leu Gin Lys Arg Pro Phe Thr Leu Lys 

85 90 95 

Gly Asp Met Gly Glu Tyr Thr Cys Asp Ala Leu He Val Ala Thr Gly 

100 105 110 

Ala Ser Ala Lys Tyr Leu Gly Leu Pro Ser Glu Glu Ala Phe Ala Gly 

115 120 125 

Lys Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe Tyr Lys Asn 

130 135 140 

Gin Asp Val Ala Val Val Gly Gly Gly Asn Thr Ala Val Glu Glu Ala 
145 150 155 160 

Leu' Tyr Leu Ala Asn He Ala Lys Thr Val Thr Leu He His Arg Arg 

165 ' 170 175 

Ser Glu Phe Arg Ala Glu Lys He Met He Asp Lys Leu Met Lys Arg 

180 ~ 185 190 

Val Glu Glu Gly Lys He He Leu Lys Leu Glu Ser Asn Leu Gin Glu 

195 200 ~ 205 

Val Leu Gly Asp Asp Arg Gly Val Asn Gly Ala Leu Leu Lys Asn Asn 

210 215 220 

Asp Gly Ser Glu Gin Gin He Ala Val Ser Gly He Phe He Ala He 
225 230 235 240 

Gly His Lys Pro Asn Thr Asp He Phe Lys Gly Gin Leu Glu Met Asp 

245 " 250 255 

Glu Ala Gly Tyr Leu Lys Thr Lys Gly Gly Thr Ala Asp Asn Val Gly 

260 265 270 

Ala Thr Asn He Glu Gly Val Trp Ala Ala Gly Asp Val Lys Asp His 

275 280 285 

Thr Tyr Arg Gin Ala He Thr Ser Ala Ala Ser Gly Cys Gin Ala Ala 

290 295 300 

Leu Asp Ala Glu Arg Trp Leu Gly Ser Gin Asn He 
305 310' 315 



<210> 260 
<211> 316 
<212> PRT 

<213> Neisseria meningitidis 
<400> 260 

Met Ser Gin His Arg Lys Leu He He Leu Gly Ser Gly Pro Ala Gly 
15 10 15 
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Tyr Thr 

lie Thr 

Asp Asn 
50 

Ala Arg 
65 

Asp Gin 

Gly Asp 

Ala Ser 

Lys Gly 
130 
Gin Asp 
145 

Leu Tyr 

Ser Glu 

Val Glu 

Val Leu 
210 
Asp Gly 
225 

Gly His 

Glu Ala 

Ala Thr 

Thr Tyr 
290 
Leu Asp 
305 



Ala Ala 

20 
Gly lie 
35 

Trp Pro 

Phe Leu 

lie Asn 

Met Gly 
100 
Ala Lys 
115 

Val Ser 

Val Ala 

Leu Ala 

Phe Arg 
180 
Glu Gly 
195 

Gly Asp 

Ser Glu 

Lys Pro 

Gly Tyr 
260 
Asn lie 
275 

Arg Gin 
Ala Glu 



Val Tyr Ala Ala 
Ala Gin 
Ala Asp 



Ala His 

70 
Ala Val 
85 

Glu Tyr 

Tyr Leu 

Ala Cys 

Val Val 
150 
Asn lie 
165 

Ala Glu 

Lys lie 

Asp Arg 

Gin Gin 
230 
Asn Thr 
245 

Leu Lys 

Glu Gly. 

Ala lie 

Arg Trp 
310 



Gly Gly 

40 
Ala Asp 
55 

Ala Glu 



Asp Leu 

Thr Cys 

Gly Leu 
120 
Ala Thr 
135 

Gly Gly 

Ala Lys 

Lys lie 

lie Leu 
200 
Gly Val 
215 

He Ala 

Asp He 

Thr Lys 

Val Trp 
280 
Thr Ser 
295 

Leu Gly 



Arg Ala 
25 

Gin Leu 

Gly Val 

Arg Phe 

Gin Lys 
90 

Asp Ala 
105 

Pro Ser 

Cys Asp 

Gly Asn 

Thr Val 
170 
Met He 
185 

Lys Leu 

Asn Gly 

Val Ser 

Phe Lys 
250 
Gly Gly 
265 

Ala Ala 
Ala Ala 
Ser Gin 



Asn Leu 

Met Thr 

Gin Gly 

60 
Gly Thr 
75 

Arg Pro 

Leu He 

Glu Glu 

Gly Phe 
140 
Thr Ala 
155 

Thr Leu 

Asp Lys 

Glu Ser 

Ala Leu 
220 
Gly He 
235 

Gly Gin 

Thr Ala 

Gly Asp 

Ser Gly 
300 
Asn He 
315 



Asn Pro 

30 
Thr Thr 
45 

Pro Glu 

Glu He 

Phe Thr 

Val Ala 
110 
Ala Phe 
125 

Phe Tyr 

Val Glu 

He His 

Leu Met 
190 
Asn Leu 
205 

Leu Lys 

Phe He 

Leu Glu 

Asp Asn 
270 
Val Lys 
285 

Cys Gin 



Val He 

Glu Val 

Leu Met 

He Phe 

80 
Leu Lys 
95 

Thr Gly 

Ala Gly 

Lys Asn 

Glu Ala 
160 
Arg Arg 
175 

Lys Arg 

Gin Glu 

Asn Asn 

Ala He 
240 
Met Asp 
255 

Val Gly 
Asp His 
Ala Ala 



<210> 261 

<211> 316 

<212> PRT 

<213> Pseudomonas aeruginosa 



<400> 261 



Met 


Ser 


Glu 


Val 


Lys 


His 


Ser 


Arg Leu lie He Leu Gly Ser Gly Pro 


1 








5 






10 15 


Ala 


Gly 


Tyr 


Thr 


Ala 


Ma 


Val 


Tyr Ala Ala Arg Ala Asn Leu Lys Pro 








20 








25 30 


Val 


Val 


He 


Thr 


Gly 


He 


Gin 


Pro Gly Gly Gin Leu Thr Thr Thr Thr 






35 










40 45 


Glu 


Val 


Asp 


Asn 


Trp 


Pro 


Gly 


Asp Val Glu Gly Leu Thr Gly Pro Ala 




50 










55 


60 


Leu 


Met 


Thr 


Arg 


Met 


Gin 


Gin 


His Ala Glu Arg Phe Asp Thr Glu He 


65 










70 




75 - 80 


Val 


Tyr 


Asp 


His 


He 


His 


Thr 


Ala Glu Leu Gin Gin Arg Pro Phe Thr 










85 






90 95 


Leu 


Lys 


Gly 


Asp 


Ser 


Gly 


Thr 


Tyr Thr Cys Asp Ala Leu He He Ala 








100 








105 110 


Thr 


Gly 


Ala 


Ser 


Ala 


Gin 


Tyr 


Leu Gly Met Ser Ser Glu Glu Ala Phe 






115 










120 125 


Met 


Gly 


Lys 


Gly 


Val 


Ser 


Ala 


Cys Ala Thr Cys Asp Gly Phe Phe Tyr 




130 










135 


140 


Arg 


Asn 


Gin 


Val 


Val 


Cys 


Val 


Val Gly Gly Gly Asn Thr Ala Val Glu 


145 










150 




155 160 


Glu 


Ala 


Leu 


Tyr 


Leu 


Ala 


Asn 


He Ala Lys Glu Val His Leu He His 



-154- 



WO 02/50289 



PCT/US01/50240 



165 170 175 

Arg Arg Asp Lys Leu Arg Ser Glu Lye lie Leu Gin Asp Lys Leu Phe 

180 185 190 

Asp Lys Ala Glu Asn Glv Asn Val His Leu His Trp Asn Thr Thr.Leu 

195 200 205 

Asp Glu Val Leu Gly Asp Ala Ser Gly Val Thr Gly Val Arg Leu Lys 

210 215 220 

Ser Thr lie Asp Gly Ser Thr Ser Glu Leu Ser Leu Ala Gly Val Phe 
225 230 235 240 

lie Ala lie Gly His Lys Pro Asn Thr Asp Leu Phe Gin Gly Gin Leu 

245 250 255 

Glu Met Arg Asp Gly Tyr Leu Arg lie His Gly Gly Ser Glu Gly Asn 

260 265 270 

Ala Thr Gin Thr Ser lie Glu Gly Val Phe Ala Ala Gly Asp Val Ala 

275 280 285 

Asp His Val Tyr Arg Gin Ala He Thr Ser Ala Gly Ala Gly Cys Met 

290 295 300 

Ala Ala Leu Asp Ala Glu Lys Tyr Leu Asp Asp His 
305 310 315 



<210> 262 
<211> 316 
<212> PRT 

<213> Pseudomonas aeruginosa 



<400> 262 



1 T 1G I— 


rxu 


A en 


liii 


Leu 


Arg 


Hie: 


Al o T\ t*/^t \7ra "1 Tl p T1 a T.an fill ir Got" CI"] -ir 

/ixa ru.^ v ct j. lie lie xtcu \j±y oci uxy 


1 








q 






iU ID 


Pro 


Ala 


Glv 


Tvr 
xyj. 


Ser 


Ala 


Ala 


VaI Tvt Al a Ala At*ct Rl a Actn Tien T»\rc 
v cix xyi rxxci nxa /u.y nia noil xicu uy a 








9ft 








9^ *5ft 


IrX \J 




i-JCU 


Tie 

X xc 


X11X. 






v7iii t\±cx uxy oxy vjxix xicu xixx xxix xixx 
















Aft AC 


Thr 


Glu 


Val 


Asp 


Asn 


Trp 


Pro 


Gly Asp Pro His Gly Leu Thr Gly Pro 




50 










55 


60 


Ala 


Leu 


Met 


Gin 


Arg 


Met 


Gin 


Glu His Ala Glu Arg Phe Glu Thr Glu 


65 










70 




75 80 


He 


Val 


Phe 


Asp 


His 


He 


His 


Ala Val Asp Leu Ala Gly Lys Pro Phe 










85 






90 ~ 95 


Thr 


Leu 


Arg 


Gly 


Asp 


Asn 


Gly 


Thr Tyr Thr Cys Asp Ala Leu He Val 








100 








105 110 


Ala 


Thr 


Gly 


Ala 


Ser 


Ala 


Arg 


Tyr Leu Gly Leu Pro Ser Glu Gin Ala 






115 










120 125 


Phe 


Met 


Gly 


Lys 


Gly 


Val 


Ser 


Ala Cys Ala Thr Cys Asp Gly Phe Phe 




130 










135 


140 


Tyr 


Arg 


Asn 


Arg 


Glu 


Val 


Ala 


Val He Gly Gly Gly Asn Thr Ala Val 


145 










150 




155 160 


Glu 


Glu 


Ala 


Leu 


Tyr 


Leu 


Ala 


Asn He Ala Ser Arg Val Thr Leu Val 










165 






170 175 


His 


Arg 


Arg 


Glu 


Thr 


Phe 


Arg 


Ala Glu Lys He Leu Gin Asp Lys Leu 








180 








185 190 


Gin 


Ala 


Arg 


Val 


Ala 


Glu 


Gly 


Lys He Val Leu Lys Leu Asn Ala Glu 






195 










200 205 


Val 


Asp 


Glu 


Val 


Leu 


Gly 


Asp 


Thr Met Gly Val Thr Gly Val Arg Leu 




210 










215 


220 


Lys 


Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Glu Glu He Ala Val Asp Gly Met Phe 


225 










230 




235 240 


Val 


Ala 


He 


Gly 


His 


Thr 


Pro 


Asn Thr Ser Leu Phe Glu Gly Gin Leu 








245 






250 255 


Ala 


Leu 


Lys 


Asp 


Gly 


Tyr 


Leu 


Val Val Asn Gly Gly Arg Glu Gly Asn 








260 








265 270 


Ala 


Thr 


Ala 


Thr 


Asn 


Val 


Pro 


Gly Val Phe Ala Ala Gly Asp Val Ala 






275 










280 285 


Asp 


His 


Val 


Tyr 


Arg 


Gin 


Ala 


He Thr Ser Ala Gly Ala Gly Cys Met 




290 










295 


300 


Ala 


Ala 


Leu 


Asp 


Val 


Glu 


Arg 


Tyr Leu Asp Ser Leu 


305 










310 




315 
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<210> 263 
<211> 345 
<212> PRT 

<213> Pyrococcus abyssi 



<400> 263 



Met Leu Leu 


Asn 


He 


His 


Gin Glu 


Ser Tyr Val 


Glu Val Val Lys 


Met 


1 




K 






10 


15 




Phe Sear Leu 


Gly 


Glv 


Leu 


Gly Lys 


Ser Arg Val 


Asp Glu Ser Lys 


Val 




20 








25 


30 




Tttd Asr> Val 


He 


He 


He 


Gly Ala 


Glv Pro Ala 


Gly Tyr Thr Ala 


Ala 


35 








40 




45 




lie Tyr Ala 


Ala 


Axq 


Phe 


Gly Leu 


Asp Thr He 


He He Thr Lys 


Asp 


50 








55 




60 




Leu Glv Glv 


Asn 


Met 


Ala 


He Thr 


Asn Leu He 


Glu Asn Tyr Pro 


Glv 


65 










75 




80 


Phe Pro Glu 


Glv 


He 


Ser 


Gly Ser 


Glu Leu Ala 


Lys Arg Met Tyr 


Glu 






85 






90 


95 




His Val Lvs 


Lys 


Tvr 

JT 


Glv 


Val Asn 


Val lie Phe 


Asp Glu Val Val 


Arcr 




i nn 








i ft*; 


XX \J 




Tie Art* Pro 


Ala 


Glu 




Ala Tyr 


TVr Glu Glv 

JL Jf X V3JLU 


Pro CVfi Gin Phe 


Glu 


XX3 








ion 

X \J 




IOC 

X £l J 




Val Tivs Thr 


Ala 


Asn 


Glv 


Tivs nil 11 


Tvr TiVR Glv 


Tivs Thr Tie Tie 


He 


1JU 








Ijj 




x*± U 




Ala Val Glv 


Ala 


Glu 


Pro 


Arg Lys 


Tien TT*Ls Val 


Pro Glv Glu Lvs 


Glu 


1 / C 










1 




1 fin 

ID U 


Phe Tin* Glv 




Glv 


Val 

v ax 


Ser Tyr 


Pvs Ala Thr* 


Cvs Asn Glv Pro 


Leu 






1 fie: 






x / u 


JL /O 




Phe Val Glv 

iSMV VUiL VJJ-Jr 


Lys 


Glu 


Val 


He Val 


Val Glv Glv 


Glv Asn Thr Ala 


Leu 




1 PO 








1 


1 Of) 




VjXII VJJL U. nld 


Leu 


Tyr 


Leu 


nib ocx 


T 1 f2l \r Va 1 
lie vJJ-y Val 


Ti^rc: Val TVir T.»n 
Xjyo val JL 1 IX ucu 


Va 1 
Val 


















X1J.0 AIM flL^ 




x>y o 






7\ o y"» T ,\rc* Tip 
no^J XJjr S3 X X c 


T.cai i fil n ZVptn ZXrcr 


Leu 


a JL V 


















Gly 


Tl e 

X XG 


Prr> 
irx 


Thr Tip 

-L XIX 1JLC 


T .pi i Ren T ri r 


Val Val Thr Gin 

VAX V CtX llli. wX U 


Tl e 

X xc 


225 










^ j j 




240 


Arg Gly Thr 


Asn 


Lys 


Val 


Glu Ser 


Val Val Leu 


Lys Asn Val Lys 


Thr 






245 






250 


255 




Gly Glu Thr 


Phe 


Glu 


Lys 


Lys Val 


Asp Gly Val 


Phe He Phe He 


Gly 




260 








265 


270 




Tyr Glu Pro 


Lys 


Thr 


Asp 


Phe Val 


Lys His Leu 


Gly He Thr Asp 


Glu 


275 








280 




285 




Tyr Gly Tyr 


He 


Lys 


Val 


Asp Met 


Tyr Met Arg 


Thr Lys Val Pro 


Gly 


290 








295 




300 




He Phe Ala 


Ala 


Gly 


Asp 


He Thr 


Asn Val Phe 


Lys Gin He Ala 


Val 


305 




310 




315 




320 


Ala Val Gly 


Gin 


Gly 


Ala 


He Ala 


Ala Asn Ser 


Ala Lys Glu Phe 


He 






325 






330 


335 




Glu Ser Tip 


Asn 


Gly 


Lys 


Ser He 


Glu 








340 








345 







<210> 264 
<211> 334 
<212> PRT 

<213> Rickettsia prowazekii 
<400> 264 

Met Tyr Asn Thr Asp He. Val He 

1 5 
Phe Ala Val Phe Gin Ala Gly Met 
20 

Asp Ala Gin Glu Val He Gly Gly 

35 40 
Lys His He Tyr Asp He Pro Ala 
50 55 



He Gly Ser Gly Pro Val Gly Leu 

10 15 
Leu Gly Met Lys Cys His Val He 
25 30 
Gin Cys He Thr Leu Tyr Pro Glu 
45 

Tyr Pro Lys He Ala Ala Lys Glu 
60 
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Leu He Lys Gin Leu Glu Ser Gin Ala Ala Pro Phe Asn Pro Val Tyr 
65 70 75 80 

His Leu Asn Gin Gin Ala Thr Glu Leu Asn Lys His Asp Asp Phe Phe 

85 90 95 

Glu He Lys Thr Ser Lys Asn Thr Leu He Lys Ser Lys Val He He 

100 105 110 

He Ala Ala Gly Ala Gly Ala Phe Gly Pro Asn Lys Pro Pro He Ala 

115 120 ~ 125 

Asn He Glu Ala Phe Glu Gly Lys Ser He Phe Tyr Phe He Asn Asp 

130 135 140 

Lys Ser Lys Phe Leu Gly Lys Asn He Val Val Ala Gly Gly Gly Asp 
145 150 155 160 

Ser Ala Val Asp Trp Ala He Thr Leu Ser Glu He Ala Asn Lys He 

165 170 175 

Tyr Leu Val His Arg Arg Asp Lys Phe Thr Ala Ala Thr Glu Ser Val 

180 185 190 

Arg Gin Leu Arg His He Ala Glu Thr Gly Lys He Glu Leu Val Thr 

195 200 205 

Gly Tyr Gin Leu Asn Asn Leu Asp Gly His Asn Ser Glu Leu Arg Ser 

210 215 220 

Val He Val Lys Asp Leu Gin Asn Asn He Arg Lys Leu Asp Ala Asn 
225 ~ 230 235 240 

He Leu Leu Pro Phe Phe Gly Leu Lys Gin Asp Leu Gly Pro Leu Ala 

245 " 250 255 

Asn Trp Gly Phe Asn Val Arg Leu Gin His He Glu Val Asp Asn Tyr 

260 265 270 

Tyr Tyr Gin Thr Asn He Lys Gly He Tyr Ala He Gly Asp Val Ala 

275 280 285 

His Tyr Val Gly Lys Leu Lys Leu He He Thr Gly Phe Ala Glu Ala 

290 295 300 

Ala Cys Ser Leu His His Ala Tyr Ser Arg Val Phe Asp Gly Lys Ala 
305 ~ 310 315 320 

Leu His Phe Glu Tyr Ser Thr Asn Lys Tyr Glu Gin Lys Gin 
325 330 



<210> 265 
<211> 311 
<212> PRT 

<213> Staphylococcus aureus 
<400> 265 

Met Thr Glu He Asp Phe Asp He Ala He He Gly Ala Gly Pro Ala 

1 5 10 15 

Gly Met Thr Ala Ala Val Tyr Ala Ser Arg Ala Asn Leu Lys Thr Val 

20 25 30 

Met He Glu Arg Gly He Pro Gly Gly Gin Met Ala Asn Thr Glu Glu 

35 40 45 

Val Glu Asn Phe Pro Gly Phe Glu Met He Thr Gly Pro Asp Leu Ser 

50 55 60 

Thr Lys Met Phe Glu His Ala Lys Lys Phe Gly Ala Val Tyr Gin Tyr 
65 ~ 70 ~ 75 80 

Gly Asp He Lys Ser Val Glu Asp Lys Gly Glu Tyr Lys Val He Asn 

85 90 95 

Phe Gly Asn Lys Glu Leu Thr Ala Lys Ala Val He He Ala Thr Gly 

100 105 110' 

Ala Gly Tyr Lys Lys He Gly Val Pro Gly Glu Gin Glu Leu Gly Gly 

115 120 125 

Arg Gly Val Ser Tyr Cys Ala Val Cys Asp Gly Ala Phe Phe Lys Asn 

130 " 135 140 

Lys Arg Leu Phe Val He Gly Gly Gly Asp Ser Ala Val Glu Glu Gly 
145 150 ~ 155 160 

Thr Phe Leu Thr Lys Phe Ala Asp Lys Val Thr He Val His Arg Arg 

165 170 175 

Asp Glu Leu Arg Ala Gin Arg He Leu Gin Asp Arg Ala Phe Lys Asn 

180 185 190 

Asp Lys He Asp Phe He Trp Ser His Thr Leu Lys Ser lie Asn Glu 
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195 






200 


Lys 


ASP 


Gly Lys Val 


Gly 


Ser 


Val 




210 






215 




Ser 


Glu 


Glu Thr His 


Glu 


Ala 


Asp 


225 






230 




Lys 


Pro 


Leu Thr Ala 


Pro 


Phe 


Lys 

2. 






245 






Gly 


Tyr 


He Val Thr 


Lys 


Asp 


Asp 






260 








Phe 


Ala 


Ala Gly Asp 


Val 


Arg 


Asp 






275 






280 


Ala 


Thr 


Gly Asp Gly 


Ser 


He 


Ala 




290 






295 




Glu 


His 


Leu Asn Asp 


Gin 


Ala 




305 




310 







205 

Thr Leu Thr Ser Thr Lys Asp Gly 
220 

Gly Val Phe lie Tyr He Gly Met 
235 240 
Asp Leu Gly He Thr Asn Asp Val 

250 255 
Met Thr Thr Ser Val Pro Gly He 
265 270 
Lys Gly Leu Arg Gin He Val Thr 
2 85 

Ala Gin Ser Thr Ser Gly Tyr He 
300 



<210> 266 
<211> 326 
<212> PRT 

<213> Streptomyces coelicolor 



<400> 266 

Met Ser Thr Ala Lys Asp Val Arg Asp Val He Val He Gly Ser Gly 

1 5 10 15 

Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Thr Ala Arg Ala Ser Leu Asn 

20 25 30 

Pro Leu Val Phe Gly Gly Ala He Phe Val Gly Gly Ser Leu Thr Thr 

35 40 45 

Thr Thr Glu Val Glu Asn Phe Pro Gly Phe Pro Asp Gly Val Gin Gly 

50 55 60 

Pro. Glu Leu Met Glu Asn Met Arg Ala Gin Ala Glu Arg Phe Gly Ala 
65 70 75 80 

Glu Met Val Asp Asp Asp He Val Ala Val Asp Leu Thr Gly Asp Val 

85 90 95 

Lys Thr Val Thr Asp Thr Ala Gly Thr Val His Arg Ala Arg Thr Val 

100 105 110 

He Val Ala Thr Gly Ser Gly Tyr Arg Lys Leu Gly Val Pro Lys Glu 

115 120 125 

Asp Glu Leu Ser Gly Arg Gly Val Ser Trp Cys Ala Thr Cys Asp Gly 

130 135 140 

Phe Phe Phe Arg Asp Arg Asp He Val Val Val Gly Gly Gly Asp Thr 
145 150 " 155 ~ " 160 

Ala Met Glu Glu Ala Thr Phe Leu Thr Arg Phe Ala Arg Ser Val Thr 

. 165 170 175 

Val Val His Arg Arg Ser Ala Leu Arg Ala Ser Gin Val Met Gin Asn 

180 185 190 

Arg Ala Phe Ser Glu Asp Lys He Ser Leu Ala Phe Asp Ser Glu Val 

195 200 205 

Ala Thr Leu His Glu Glu Asn Gly Met Leu Ser Gly Met Thr Leu Arg 

210 215 220 

Asp Thr Leu Thr Gly Glu Thr Arg Glu Leu Ala Thr Thr Gly Leu Phe 
225 230 235 240 

He Ala He Gly His Asp Pro Arg Thr Glu Leu Phe Lys Gly Gin Leu 

245 250 255 

His Leu Asp Ser Glu Gly Tyr Leu Met Val Glu Ser Pro Ser' Thr Arg 

260 * " 265 270 

Thr Asn Val Pro Gly Val Phe Gly Ala Gly Asp Val Val Asp His Thr 

275 280 285 

Tyr Arg Gin Ala He Thr Ala Ala Ser Ser Gly Cys Ala Ala Ala Leu 

290 295 300 

Asp Ala Glu Arg Tyr Leu Ala Ala Arg Ser Asp Thr Ser Val Ser Ala 
305 310 315 320 

Glu Val Val Ala Val Ala 
325 



-158- 



WO 02/50289 



PCT/US01/50240 



<210> 267 

<211> 558 

<212> PRT 

<213> Streptomyces coelicolor 



<400> 267 



Met 


Ala 


Gin 


Ala 


Asp 


Gly 


Glu 


Thr Arg 


Thr 


Val 


He 


Met Thr 


Val Asp 


1 








5 






10 








15 


Asp 


Asp 


Pro 


Gly 


Val 


Ser 


Arq 


Ala Val 


Ala 


Arg 


Asp 


Leu Arg 


Arq Arq 






20 








25 








30 




Tvr 


Glv 


Ala 


Thr 


Tvr 


Arcr 


He 


Val Arg 


Ala 


Glu 


Ser 


Gly Glu 


Ser Ala 






35 










40 








45 




Leu 


Asn 


Ala 


Leu 


Ara 


Glu 


Leu 


Lys Leu 


Ara 


Gly 


Asp 


Leu Val 


Ala Val 




50 










55 








60 






lie 


Leu 


Ala 


Asd 


Tvr 


Ara 


Met 


Pro Gin 


Met 


Asn 


Gly 


lie Glu 


Phe Leu 


65 






70 








75 






80 


Glu 


Gin 


Ala 


Leu 


Asp 


Val 


Tvr 


Pro Gly 


Ala 


Ara 


Ara 


Val Leu 


Leu Thr 










85 








90 








95 


Ala 


Tvr 

J; 


Ala 


Aso 


Thr 


Asn 


Ala 


Ala He 


Asp 


Ala 


He 


Asn Val 


Val Asp 








100 








105 








110 




Leu 


Asp 


His 


Tvr 


Leu 


Leu 


Lys 


Pro Trp 


Asp 


Pro 


Pro 


Glu Glu 


Lys Leu 






115 










120 








125 




Tvr 


Pro 


Val 


Leu 


Asp 


Asp 


Leu 


Leu Gin 


Ala 


Tro 


Ara 


Ala Gly 


Asp His 




130 










135 








140 








Pro 


Val 


Pro 


Ser 


Thr 


Lys 


Val Val 


Glv 


His 


Ara 


Trp Ser 


Ala Arg 


145 










150 








155 






160 


Seir 


Ser 


Glu 


Val 


Arcr 


Glu 


Phe 


Leu Ala 


Ara 


Asn 


Gin 


Val Pro 


Tvr Arcr 










165 








170 








175 


Tm 

XT 


± y j - 


Ser 


Ser 


Asp 


Glu 


Pro 


Glu Gly 


Ara 


Arcr 


Leu 


Leu Ser 


Ala Ala 








180 








185 








190 




Glv 


Gln 


Asp 


Glv 


Gin 


Ara 


Leu 


Pro Val 


Val 


He 


Thr 


Pro Asp 


Gly Thr 






195 










200 








205 






Leu 


Val 


Glu 


Pro 


Glu 


Ala 


Pro Glu 


Leu 


Ala 


Ala 


Arg Val 


Glv Leu 




210 










215 








220 


Ala 


Thr 


Thr 


Pro 


Thr* 


Ser 


Asp 


Phe Tyr 


Asp 


Leu 


Val 


Val He 


Gly Gly 


225 










230 








235 






240 


Glv 


Pro 


Ala 


Glv 


Leu 


Glv 


Ala 


Ala Val 


Tvr 


Glv 


Ala 


Ser Glu 


Glv Leu 










245 








250 








255 


Ara 


Thr 


Val 


Leu 


Val 


Glu 


Ara 


Ser Ala 


Thr 


Glv 


Glv 


Gin Ala 


Gly Gin 








260 








265 








270 




Ser 


Ser 


Arg 


He 


Glu 


Asn 


Tvr 


Leu Glv 


Phe 


Pro 


Asp 


Gly Val 


Ser Gly 






275 










280 








285 




Glv 


Gin 


Leu 


Thr 


Glu 


Ara 


Ala 


Arg Arg 


Gin 


Ala 


Ala 


Arg Phe 


Gly Ala 




290 










295 








300 






Glu 


lie 


Leu 


Thr 


Ala 


Arcr 


Glu 


Val Thr 


Glv 


Leu 


Glu 


Ala Asn 


Gly Ala 


305 










310 






315 






320 


Ala 


Ara 


Val 


Val 


Arcr 


Phe 


Ser 


Asd Glv 


Ser 


Ala 


He 


Ala Ala 


His Ser 










325 








330 








335 


Val 


lie 


Leu 


Ala 


Thr 


Glv 


Val 


Ser Tyr 


Ara 


Gin 


Leu 


Thr Ala 


Pro Gly 








340 






345 








350 




Thr 


Glu 


Asp 


Leu 


Ala 


Glv 


Cvs 


Gly Val 


Phe 


Tvr 


Glv 


Ser Ala 


Leu Thr 






355 










360 








365 




Glu 


Ala 


Ala 


Ser 


Cys 


Gin 


Glv 


His Asp 


Val 


Tvr 


He 


Val Gly 


Gly Ala 




370 










375 








380 






Asn 


Ser 


Ala 


Gly 


Gin 


Ala 


Ala 


Met Tyr 


Leu 


Ala 


Arg 


Gly Ala 


Lys Ser 


385 










390 








395 






400 


Val 


Thr 


Leu 


Leu 


Val 


Arg 


Gly 


Gly Ser 


Leu 


Glu 


Ala 


Ser Met 


Ser Tyr 










405 




410 








415 


Tyr 


Leu 


He 


Gin 


Gin 


lie 


Glu 


Glu Thr- 


Pro 


Asn 


He 


Arg Val 


Arg Cys 






420 








425 








430 




Gly 


Thr 


Leu 


Val 


Glu 


Gly 


Ala 


His Gly 


Asp 


Gly 


His 


Leu Glu 


Arg Leu 




435 










440 








445 




Thr 


Leu 


Arg 


Asp 


Ala 


Ala 


Ser 


Gly Ala 


Thr 


Glu 


Leu 


Val Asp 


Ala Gin 




450 










455 






460 






Trp 


Leu 


Phe 


Val 


Phe 


He 


Gly 


Ala Ala 


Pro 


Leu 


Thr 


Asp Trp 


Leu Asp 


465 










470 








475 






480 


Gly 


Thr 


Val 


Leu 


Arg 


Asp 


Glu 


Arg Gly 


Phe 


He 


Leu 


Ala Gly 


Pro Asp 
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485 










490 






495 




Leu Thr Pro Asp Gly 


Arg 


Pro 


Pro 


Ala 


Gly Trp Glu 


Leu 


Asp 


Arg 


Pro 


500 






505 






510 






Pro Tyr His Leu Glu 


Thr 


Ser 


Val 


Pro 


Gly Val Phe 


Val 


Ala 


Gly 


Asp 


515 






520 






525 








Ala Arg Ala Glu Ser 


Ala Lys Arg Val 


Ala Ser Ala 


Val 


Gly 


Glu 


Gly 


530 




535 






540 










Ala Met Ala Val Met 


Leu 


Val 


His 


Arg 


Tyr Leu Glu 


Gin 


Ser 






545 


550 








555 











<210> 268 
<211> 303 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 268 






Met 


Tvr 


ZXerj Th*r" Tie 


Tin Tl« 

lie xxe 


r*l v 
vjxy 


i 

X 




C 

-J 








XJtSLL 


TSrf 7\ 1 -a Airs 
iyjL nXd 4"lXcl 


_ 

Arg oer 


- 

Asn 












Glv 


Leu 


Pro Glv Glv 

XTJ.W wjujr UXjr 


VaXii l Y ic;u 


7\ C3T1 










40 




vrxy 


iyi nlcX nlaii 


TT a Car 

lie DCl 


m v 




en 




DO 




Glu 


ir X U 


HCU \JlLl riaXl 


lieu vjxy 


V dX 


Dj 






*7ft 
/ U 




Apsti 


V CX X 


vixu rTkOjk^ nxs 


vixy A.sp 


Jrlie 






Q C 






X11X 


xyr 


XX XL r\J- y 


inr vai 


xxe 






i nn 






Pro 


Leu 


Gly Val Pro 


m ir m ii 
uiy biu 


bill 






115 




120 


Tyr 


Cys 


Ala Val Cys 


Asp Gly 


Ala 




130 




135 




Val 


Val 


Gly Gly Gly 


Asp Ser 


Ala 


145 






150 




Arg 


Phe 


Ala Lys Thr 


Val Thr 


He 






165 






Ala 


Gin 


Lys Val Leu 


Gin Asp 


Arg 






180 






Phe 


He 


Trp Asp Ser 


Val Val 


Arg 






195 




200 


Glu 


Ser 


Val Val Phe 


Glu Asn 


Val 




210 




215 




Ala 


Phe 


Gly Gly Val 


Phe He 


Tyr 


225 






230 




Phe 


Val 


Lys Glu Leu 


Asn He 


Gin 






245 






Asp 


Asn 


His Met Lys 


Thr Ala 


Val 




260 






Val 


Arg 


Leu Lys Asp 


Leu Arg 


Gin 






275 




280 


Ala 


He 


Ala Gly Gin 


Glu Ala 


Tyr 




290 




295 





Ala 


Gly Pro Ala 


Gly 


Met 


Thr 


Ala 




10 






15 




Leu 


Lvs Val Ala 


Leu 


He 


Glu 


Glv 


25 






30 






Asn 


Thr Ser Asp 


He 


Glu 


Asn 


Tvr 






45 








Pro 


Glu Leu Ala 


Glu 


Lvs 


Met 


Phe 












Glu 


His He Tyr 


Glv 


Tvr 


Val 


Glu 














Lys 


Lvs Val Met 


Thr 


Asp 


Asp 


Gin 














Val 


Ala Thr Gly 


Ser 


Lys 


His 


Arg 


105 






110 






Glu 


Leu Asn Ser 


Arg 


Gly 


Val 


Ser 






125 








Phe 


Phe Arg Asp 


Gin 


Asp 


Leu 


Leu 




140 










Val 


Glu Glu Ala 


Leu 


Phe 


Leu 


Thr 




155 








160 


Val 


His Arg Arg 


Asp 


Gin 


Leu 


Arg 




170 






175 




Ala 


Phe Ala Asn 


Glu 


Lys 


He 


Ser 


185 






190 






Glu 


He Lys Gly 


Glu 


Asn 


Arg 


Val 






205 








Lys 


Thr Gly Gin 


Val 


Thr 


Glu 


Gin 


220 










Val 


Gly Leu Asp 


Pro 


Leu 


Ser 


Asp 




235 








240 


Asp 


Gin Ala Gly 


Trp 


He 


Val 


Thr 




250 






255 




Asp 


Gly He Phe 


Ala 


Val 


Gly 


Asp 


265 






270 






Val 


Thr Thr Ala 


Val 


Gly 


Asp 


Gly 






285 








Lys 


Phe lie Thr 


Glu 


His 


Ser 





300 



<210> 269 
<211> 330 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 269 

Met Lys Asp Lys Ala Tyr Asp He 

1 5 
Gly Leu Phe Ala Ala Phe Tyr Ala 
20 



Thr He He Gly Gly Gly Pro He 

10 15 
Gly Leu Arg Gly Val Thr Val Lys 
25 30 
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lie lie 


Glu 


Ser Leu 




35 




Pro Glu 


Lys 


Met He 


50 




Val Glu 


Leu 


Thr Glu 


65 






Arg Thr 


Thr 


He Cys 






85 


Lys Gly 


Gly 


Phe Ser 






100 


Ala lie 


lie 


He Ala 




115 




Gly Leu 


Glu 


Ser Glu 


130 






Val His 


Gin 


Leu Asp 


145 






Gly Gly 


Asp 


Ser Ala 






165 


Glu Ser 


Val 


Thr Val 






180 


His Ser 


Val 


Glu Leu 




195 




Tyr Val 


Pro 


Lys Ala 


210 






Val lie 


Gin 


Lys Val 


225 






Ser Leu 


He 


Val Ser 






245 


Asn Trp 


Asn 


Leu Asp 






260 


Phe Gin 


Thr 


Ser Gin 




275 




Tyr Asn 


Gly 


Lys Val 


290 






Thr Ala 


Val 


Asn Gin 


305 






Arg Val 


Val 


His Ser 




325 



Ser 


Glu 


Leu Gly Gly Gin 






40 


Tyr 


Asp 


He Pro Ala Tyr 




55 




Asn 


Leu 


He Lys Gin Leu 


70 




75 


Leu 


Lys 


Glu Glu Val Leu 




90 


He 


Arg 


Thr Asn Lys Ala 






105 


Cys 


Gly 


Asn Gly Ala Phe 






120 


Glu 


Asn 


Phe Ala Asp His 




135 


Gin 


Phe 


Ala Gly Gin Lys 


150 




155 


Val 


Asp 


Trp Ala Leu Ala 






170 


Val 


His 


Arg Arg Asp Ala 






185 


Leu 


Lys 


Ala Ser Thr Val 






200 


Leu 


Lys 

JL 


Gly He Gly Asn 




215 




Lys 


Glu 


Asp Glu Val Leu 


230 




235 


Phe 


Gly 


Phe Ser Thr Ser 




250 


Tyr 


Lys 


Arg Ser Ser He 






265 


Glu 


Gly 


He Phe Ala He 






280 


Asp 


Leu 


He Ala Thr Gly 




295 




Ala 


He 


Asn Tyr He Tyr 


310 




315 


Thr 


Ser 


Leu He Asp 



330 



Pro Ala He Leu Tyr 
45 

Pro Ser Leu Thr Gly 
60 

Ser Arg Phe Glu Asp 
80 

Thr Phe Asp Lys Val 
95 

Glu His Phe Ser Lys 
110 

Ala Pro Arg Thr Leu 
125 

Asn Leu Phe Tyr Asn 
140 

Val Val He Cys Gly 
160 

Leu Glu Asp He Ala 
175 

Phe Arg Ala His Glu 
190 

Asn Leu Leu Thr Pro 
205 

Leu Ala Glu Lys Leu 
220 

Glu Leu Glu Leu Asp 
240 

Asn Lys Asn Leu Lys 
255 

Thr Val Ser Pro Leu 
270 

Gly Asp Ala Ala Ala 
285 

Phe Gly Glu Ala Pro 
300 

Pro Asp Arg Asp Asn 
320 



<210> 270 

<211> 325 

<212> PRT 

<213> Sulfolobus solfataricus 



<400> 270 



Met 


Pro 


Leu 


Lys 


Thr 


Tyr 


Asp 


Thr He He Val Gly Ala Gly He Ala 


1 








5 






10 15 


Gly 


Leu 


Ser 


Ala 


Ala 


Leu 


Tyr 


Ser Ser Arg Gin Lys Leu Ser Thr Leu 








20 








25 30 


Val 


Leu 


Ser 


Lys 


Asp 


Leu 


Gly 


Gly Gin Leu Thr Leu Thr Asp Leu He 






35 










40 45 


Glu 


Asn 


Tyr 


Pro 


Gly 


He 


Glu 


Ser Thr Gly Gly Leu Thr Leu Ala Gin 




50 










55 


60 


Lys 


He 


Glu 


Lys 


Gin 


Ala 


Lys 


Lys Phe Gly Ala Glu Phe He Tyr Gly 


65 










70 




75 '80 


Glu 


Glu 


Val 


Lys 


Glu 


He 


Ala 


Gin Glu Ser Asp Leu Phe He He Lys 










85 






90 95 


Gly 


He 


Lys 


Gly 


Glu 


Tyr 


Ala 


Gly Arg Ala Leu He Leu Ala Phe Gly 








100 








105 110 


Lys 


Thr 


Pro 


Arg 


Glu 


He 


Asn 


Val Pro Gly Glu Gin Glu Phe Lys Gly 




115 










120 125 


Lys 


Gly 


Val 


Ser 


Tyr 


Cys- 


Ala 


He Cys Asp Ala Ala Phe Phe Lys Gly 




130 










135 


140 


Lys 


Pro 


Ala 


Ala 


Val 


He 


Gly 


Glu Gly Glu Pro Gly He Glu Ala He 


145 










150 




155 160 


Glu 


Leu 


Leu 


Ser 


Asn 


Tyr 


Ala 


Asn Pro Ala Tyr Tyr He Thr Ser Ser 
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165 



Ser 


Tyr 


Leu Ala 


Gly 


Glu 


Glu 


Glu 






180 










Pro. 


Thr 


Val 


Lvs 


lie 


Leu 


Thr 


Ser 






195 








200 


Asn 


Ser 


Lvs 


Val 


Glu 


Glu 


Leu 


Val 




210 








215 




Gin 


Leu 


LVS 


Val 


Asp 


Gly 


Val 


lie 


225 










230 






Thr 


Glu 


Phe 


Leu 


Lys 
245 


Gly 


Phe 


Val 


lie 


Val 


Asp 


Glu 
260 


Leu 


Gly 


Arg 


Thr 


Gly 


Asp 


Val 
275 


Thr 


Gin 


Thr 


Pro 


Tyr 
280 


Glu 


Gly 
290 


Val 


Lys 


Ala 


Ala 


Leu 
295 


Ser 


Arg 


Gly 


Leu 


Pro 


Pro 


Val 


Thr 


Val 


305 










310 






Val 


Ser 


Phe Arg 


Leu 









325 



<210> 271 
<211> 323 
<212> PRT 

<213> Sulfolobus solfataricus 



<400> 271 



Met 


Ser 


Leu 


Leu 


Pro 


Atq 


Thr 


Thr 


1 








5 








Asp 


Val 


He 


He 
20 


Val 


Gly 


Leu 


Gly 


Tyr 


Ser 


Ala 


Arg 


Tyr 


Met 


Leu 


Lys 






35 










40 


Gly 


Gly 


Gin 


Leu 


Thr 


Glu 


Ala 


Gly 




50 










55 




lie 


Glu 


He 


Gin 


Ala 


Ser 


Asp 


Met 


65 










70 






Glu 


Lys 


Tyr 


Glu 


Val 


Pro 


Val 


Leu 










85 








Asn 


Arg 


Gly 


Asp 


Glu 


Phe 


Val 


Val 








100 










Lys 


Ala 


Asp 


Ser 


Val 


He 


Leu 


Gly 






115 










120 


Gly 


Val 


Pro 


Gly 


Glu 


Gin 


Glu 


Phe 




130 










135 




Ser 


Val 


Cys 


Asp 


Ala 


Pro 


Leu 


Phe 


145 










150 






Gly 


Gly 


Gly 


Asp 


Ser 


Ala 


Leu 


Glu 










165 








Ser 


Thr 


Lys 


Val 


Tyr 


Leu 


He 


His 






180 










Pro 


He 


Tyr 


Val 


Glu 


Thr 


Val 


Lys 






195 










200 


Leu 


Asn 


Ser 


Val 


Val 


Lys 


Glu 


He 




210 










215 




Val 


Val 


Val 


Glu 


Asn 


Leu 


Lys 


Thr 


225 










230 






Asn 


Gly 


Val 


Phe 


He 


Glu 


He 


Gly 










245 








Lys 


Ser 


Asn 


Gly 


He 


Glu 


Thr 


Asp 








260 










Glu 


Trp 


Met 


Arg 


Thr 


Ser 


Val 


Pro 






275 










280 


Thr 


Ser 


Ala 


Trp 


Leu 


Gly 


Phe 


Arg 




290 










295 





170 175 



He Val 


Lys 


Asn Val 


Val Asn Lys 


185 




190 




Ser Arg 


Val 


Leu Glu 


He Arg Gly 






205 






He Lys 


Arg 


Gly Asp 


Glu He 


Leu 






220 






He Glu 


Met 


Gly Tyr 


Thr Leu 


Lys 




235 






240 


Glu Leu 


Asn 


Glu Lys 


Gly Glu 


He 


250 






255 




Ser Arg 


Glu 


Gly Val 


Phe Ala 


Ala 


265 






270 




Lys Gin 


Ala 


Val Val 


Ala Ala 


Ala 






285 






Ala Tyr 


Asn 


Tyr He 


Arg Ser 


Lys 






300 






Asp Trp 


Lys 


Ala Glu 


Lys Lys 


Lys 




315 






320 


Ser Val 


• 

Lys 


Pro Gly 


Glu Lys 


Phe 


10 






15 




Pro Ala 


Ala 


Tyr Gly 


Ala Ala 


Leu 


25 




30 




Thr Leu 


Val 


He Gly 


Glu Thr 


Pro 






45 






He Val 


Asp 


Asp Tyr 


Leu Gly Leu 






60 






He Lys 


Val 


Phe Asn 


Lys His 


He 




75 






80 


Leu Asp 


He 


Val Glu 


Lys He 


Glu 


90 






95 




Lys Thr 


Lys 


Arg Lys 


Gly Glu 


Phe 


105 






110 




He Gly 


Val 


Lys Arg 


Arg Lys 


Leu 






125 






Ala Gly 


Arg 


Gly He 


Ser Tyr 


Cys 






140 






Lys Asn 


Arg 


Val Val 


Ala Val 


He 


155 






160 


Gly Ala 


Glu 


He Leu 


Ser Ser Tyr 


170 






175 




Arg Arg 


Asp 


Thr Phe 


Lys Ala 


Gin 


185 






190 




Lys Lys 


Pro 


Asn Val 


Glu Phe 


Val 




205 






Lys Gly 


Asp 


Lys Val 


Val' Lys Gin 






220 






Gly Glu 


He 


Lys Glu 


Leu Asn 


Val 




235 






240 


Phe Asp 


Pro 


Pro Thr 


Asp Phe Ala 


250 






255 




Thr Asn 


Gly 


Tyr He 


Lys Val Asp 


265 






270 




Gly Val 


Phe 


Ala Ala 


Gly Asp 


Cys 






285 






Gin Val 


He 


Thr Ala 


Val Ala 


Gin 






300 
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f 

Gly Ala Val Ala Ala Thr Ser Ala Tyr Arg Tyr Val Thr Glu Lys Lys 
305 310 315 320 

Gly Lys Lys 



<210> 272 
<211> 332 
<212> PRT 

<213> Sulfolobus solfataricus 
<400> 272 

Met Asp Glu Tyr Asp lie Val Val He Gly Gly Gly Pro Val Gly Leu 

15 10 15 

Phe Gly Thr Phe Tyr Ala Gly Leu Arg Asp Met Lys Thr Leu Leu He 

20 25 30 

Asp Ala Gin Asp Glu Leu Gly Gly Gin Leu Val Ser Leu Tyr Pro Glu 

35 40 45 

Lys He Val Tyr Asp Val Gly Gly Leu Ala Gly He Gin Ala Tyr Glu 

50 ~ 55 60 

Leu Ala Gin Arg Leu He Glu Gin Ala Lys Met Phe Gly Pro Asp He 
65 ~ 70 * 75 80 

Lys Val Asn Glu Leu Ala Asp Met He Glu Lys Thr Asn Asp Asn Met 

85 90 95 

Trp He Val Lys Thr Asp Lys Ala Thr Tyr Lys Thr Lys Thr He Phe 

100 105 110 

He Ala Ala Gly He Gly Lys He Val Pro Ser Arg Leu Gly Ala Lys 

115 * ~ " 120 125 

Gly Glu He Glu Tyr Glu Asn Arg Gly Val Tyr Tyr Thr Val Arg Arg 

130 135 140 

Lys Lys Asp Phe Glu Gly Lys Arg Val Leu He Val Gly Gly Gly Asp 
145 150 155 160 

Ser Ala Val Asp Trp Ala Leu Thr Leu Ala Pro Val Ala Lys Ser Val 

165 170 175 

Thr Leu He His Arg Arg Asp Gin Phe Arg Ala His Glu Arg Ser Val 

180 ~ 185 190 

Lys Glu Leu Phe Arg Val Ala Asn Val Tyr Val Trp His Glu Leu Lys 

195 200 205 

Glu Val Lys Gly Asp Gly Asn Lys Val Thr Gin Ala He He Phe Asp 

210 ~ ~ 215 220 

Asn Arg Thr Lys Glu Glu Lys Val Leu Asp Val Asp Ser Val He He 
225 230 ~ 235 ^ 240 

Ser He Gly Tyr Lys Gly Asp Leu Gly Asn He Pro Lys Trp Gly Val 

245 250 255 

Thr Met Lys Gly Arg Asp He Val Val Asn Gly Arg Met Glu Thr Asn 

260 265 " 270 

Leu Pro Gly Val Tyr Ala Gly Gly Asp He Val Gin Met Glu Gly Ser 

275 280 285 

Pro Lys Leu Ala Leu He Ala Val Gly Phe Ala His Ala Ala He Ala 

290 295 300 

He Ser Val Ala Lys Lys Tyr Val Glu Pro Asn Ala Ser Leu Phe Ala 
305 310 315 320 

Gly His Ser Ser Glu Met Asp Lys Phe Lys Pro Lys 
325 330 



<210> 273 
<211> 324 
<212> PRT 

<213> Rhizobium loti 



Pro Val Leu He He Gly Ser Gly Pro Ala 

10 15 
Tyr Ala Ala Arg Ala Met Leu Lys Pro Met 

25 ~ 30 
Gin Gly Gly Gin Leu Met He Thr Thr Asp 



<400> 273 

Met Thr Thr Lys His Ala 

1 * 5 

Gly Tyr Thr Ala Ala Val 
20 

Leu Val Ala Gly Leu Gin 
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35 




Val Gin 






50 






Met Glii 


Gin 


Mf*t M^t 

11C L. ric L. 


65 






Asn Asn 


lie 


lie Thr 






85 


Lys Gly 


Asp 


Ser Gly 






100 


Thr Gly 


Ala 


Gin Ala 


115 




Met Glv 


Phe 


Gly Val 


130 






Arg Gly 


Lys 


Asd Val 


145 






Gl ii AT a 


T i 








165 




QCI 


riDp ir lie 






180 


Gl n TjV«3 


Asp 


Asn Val 




195 




Thr Gly 


Arn 


Pro Gly 


210 






TjVS Tien 


Lys 


His Ala 


225 






Gly Val 


Phe 


Val Ala 








fill Tr T»xye 

vsj.y *-tys 


Lieu 


T,-i rQ Y"V 

ijys vjxh 






9 fin 

Z D U 


Ser Thr 


Arg 


Thr Asp 




275 




Asp Asp 


Val 


Tyr Arg 


290 






Ala Ala 


Leu 


Glu Ala 


305 






Glu Ala 


Ala 


Glu 



Gly Phe 

55 
Lys Gin 
70 

Glu Val 

Thr Thr 

Lys Trp 

Ser Ala 
135 
Ala Val 
150 

Ser Asn 

Arg Ala 

Arg Val 

Lys Ala 
215 
Val Thr 
230 

He Gly 

Lys Pro 

Val Pro 

Gin Ala 
295 
Glu Lys 
310 



40 

Ala Asp 

Ala £lu 

Asp Leu 

Tyr Thr 
105 
Leu Gly 
120 

Cys Ala 

Val Gly 

Leu Ala 

Glu Arg 
185 
He Trp 
200 

Pro Leu 

Gly Ala 

His Ala 

Asn Gly 
265 
Gly Val 
280 

Val Thr 
Tyr Leu 



Pro He 

His Val 

75 
Asn Val 
90 

Ala Asp 

He Pro 

Thr Cys 

Gly Gly 
155 
Lys Ser 
170 

He Leu 

Asp Thr 

Pro Pro 

Glu Thr 
235 
Pro Ala 
250 

Tyr Leu 

Phe Ala 

Ala Ala 

Ala Gly 
315 



45 

Gin Gly Pro 
60 

Gly Thr Asp 

Arg Pro Phe 

Ala Leu He 
110 

Thr Glu Gin 
125 

Asp Gly Phe 
140 

Asn Ser Ala 

Val Thr Val 

Arg Glu Arg 
190 

Val Val Asp 

205 
Ser Val Glu 
220 

His Leu Lys 

Val Glu Leu 

Trp Thr Ala 
270 

Ala Gly Asp 

285 
Gly Leu Gly 
300 

He Glu Val 



Trp Leu 

He He 
80 

Arg Ala 
95 

He Ala 

Asp Phe 

Phe Tyr 

Val Glu 
160 
He His 
175 

Leu Leu 

Glu He 

Gly Leu 

Val Asp 
240 
Phe Val 
255 

Pro Asn 

Val Thr 

Cys Met 

His Arg 
320 



<210> 274 

<211> 343 

<212> PRT 

<213> Rhizobium loti 



<400> 274 



Met 


Thr 


Gly 


He He 


Ser Thr Asp 


1 






5 


Val 


Gly 


Leu 


Phe Ala 


Val Phe Glu 








20 




His 


Leu 


He 


Asp He 


Leu Asp Lys 






35 




40 


Tyr 


Pro 


Glu 


Lys Pro 


He Tyr Asp 




50 






55 


Ala 


Gin 


Gly 


Leu Val 


Asp Lys Leu 


65 








70 


Pro 


Asp 


Phe 


Thr Tyr 


Asn Arg Met 








85 




Asp 


Gly 


Ser 


Phe Arg 


Val Thr Thr 








100 




Lys 


Val 


Val 


Val He 


Ala Ala Gly 






115 




120 


Pro 


Pro 


He 


Pro Gly 


He Glu Pro 




130 






135 


Ser 


Val 


Arg 


Arg Met 


Glu Asp Phe 


145 








150 


Gly 


Gly 


Gly 


Asp Ser 


Ala Leu Asp 








165 





Val Leu He Val Gly Ala Gly Pro 

10 15 
Leu Gly Leu Phe Asp Met Lys Cys 
25 30 
Pro Gly Gly Gin Cys Ala Glu Leu 
45 

He Pro Gly Trp Pro Ser He Ser 
60 

Leu Glu Gin He His Pro Phe Lys 

75 80 
Val Ser Ser Leu Glu Lys • Leu Glu 

90 95 
Asp Glu Asn Glu Val Phe Glu Ala 
105 110 
Gly Gly Ser Phe Gin Pro Lys Arg 
125 

Tyr Glu Gly Lys Ser Val Phe Tyr 
140 

Arg Gly His Asp Leu Val He Val 
155 160 
Trp Thr Leu Asn Leu Gin Pro Val 
170 175 



-164- 



WO 02/50289 



PCT/US01/50240 



Ala 


Lys 


Ser 


Val 


Thr 


Leu 


Val 


His 


Arg Arg Pro Glu Phe Arg Ala Ala 






180 










185 ~ 190 


Pro 


Asp 


Ser 
195 


Val 


Asn 


Lys 


Met 


Tyr 
200 


Ala Met Gin Glu Met Lys Gin Leu 
205 


Glu 


Phe 
210 


Arg 


Val 


Gly 


Gin 


Val 
215 


Thr 


Gly Leu Thr Gly Ala Asp Gly Gin 
220 


Leu 


Ser 


Ser 


Ala 


Thr 


He 


Lys 


Gly 


Gly Pro Asp Gly Asp He Glu Val 


225 










230 






235 240 


Pro 


Cvs 


Thr 


Arcr 


Met 


Leu 


Pro 


Phe 


Phe Gly Leu Thr Met Lys Leu Gly 






245 








250 255 


Pro 


He 


Ala 


Glu 


Trp 


Gly 


Leu 


Asn 


Leu His Glu Asn Leu He Pro Val 








260 






265 270 


Asp 


Thr 


Glu 


Lys 


Phe 


Gin 


Thr 


Ser- 


Val Pro Gly He Phe Ala Val Gly 






275 








280 


285 


Asp 


He 
290 


As n 


Ser 


Tyr 


Pro 


Gly 
295 


Lys 


Leu Lys Leu He Leu Ser Gly Phe 
300 


His 


Glu 


Val 


Ala 


Leu 


Met 


Ala 


Gin 


Ala Ala Lys Arg He Val Ser Pro 


305 










310 






315 320 


Gly 


Glu 


Arg 


He 


Val 


Phe 


Gin 


Tyr 


Thr Thr Ser Ser Thr Ser Leu Gin 








325 






330 335 


Lys 


Lys 


Leu 


Gly 
340 


Val 


Val 


Gly 







<210> 275 
<211> 15 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> VARIANT 
<222> 9, 11 

<223> Xaa = Any Amino Acid 
<400> 275 

Val His Asn He Val Thr He He Xaa Ser Xaa Pro Ala Ala His 
15 10 15 



<210> 276 
<211> 104 
<212> PRT 

<213> Staphylococcus aureus 
<400> 276 

Met Ala He Val Lys Val Thr Asp Ala Asp Phe Asp Ser Lys Val Glu 

15 10 15 

Ser Gly Val Gin Leu Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys 

20 25 30 

Lys Met He Ala Pro Val Leu Glu Glu Leu Ala Ala Asp Tyr Glu Gly 

35 40 45 

Lys Ala Asp He Leu Lys Leu Asp Val Asp Glu Asn Pro Ser Thr Ala 

50 55 60 

Ala Lys Tyr Glu Val Met Ser He Pro Thr Leu He Val Phe Lys Asp 
65 70 75 80 

Gly Gin Pro Val Asp Lys Val Val Gly Phe Gin Pro Lys Glu Asn Leu 

85 " 90 95 

Ala Glu Val Leu Asp Lys His Leu 
100 



<210> 277 
<211> 92 
<212> PRT 

<213> Staphylococcus xylosus 
<400> 277 
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Met Ala Glu Gin Val Asp Phe Asp lie Ala lie He Gly Ala Gly Pro 

15 10 15 

Ala Gly Met Thr Ala Ala Val Tyr Ala Ser Arg Ala Asn Leu Ser Thr 

20 25 ~ 30 

Val Met He Glu Arg Gly Met Pro Gly Gly Gin Met Ala Asn Thr Glu 

35 40 45 

Glu Val Glu Asn Phe Pro Gly Phe Glu Met Val Thr Gly Pro Asp Leu 

50 55 60 

Ser Thr Lys Met Phe Glu His Ala Lys Lys Phe Gly Ala Lys Tyr Gin 
65 70 75 " 80 

Tyr Gly Asp He Lys Ser He Glu Asp Lys Gly Ser 
85 90 



<210> 278 
<211> 319 
<212> PRT 

<213> Thermoplasma acidophilum 
<400> 278 

Met Glu Phe Asn Leu His Ala Val Ser Ser Glu Glu Lys Glu Arg Asp 

15 10 ^ 15 

Phe Asp Val Val He Val Gly Ala Gly Ala Ala Gly Phe Ser Ala Ala 

20 25 30 

Val Tyr Ala Ala Arg Ser Gly Phe Ser Val Ala He Leu Asp Lys Ala 

35 40 45 

Val Ala Gly Gly Leu Thr Ala Glu Ala Pro Leu Val Glu Asn Tyr Leu 

50 55 60 

Gly Phe Lys Ser He Val Gly Ser Glu Leu Ala Lys Leu Phe Ala Asp 
65 70 75 80 

His Ala Ala Asn Tyr Ala Lys He Arg Glu Gly Val Glu Val Arg Ser 

85 90 95 

He Lys Lys Thr Gin Gly Gly Phe Asp He Glu Thr Asn Asp Asp Thr 

100 105 110 

Tyr His Ala Lys Tyr Val He He Thr Thr Gly Thr Thr His Lys His 

115 120 125 

Leu Gly Val Lys Gly Glu Ser Glu Tyr Phe Gly Lys Gly Thr Ser Tyr 

130 135 140 

Cys Ser Thr Cys Asp Gly Tyr Leu Phe Lys Gly Lys Arg Val Val Thr 
145 150 155 ~ 160 

lie Gly Gly Gly Asn Ser Gly Ala He Ala Ala He Ser Met Ser Glu 

165 170 175 

Tyr Val Lys Asn Val Thr lie lie Glu Tyr Met Pro Lys Tyr Met Cys 

180 185 190 

Glu Asn Ala Tyr Val Gin Glu He Lys Lys Arg Asn He Pro Tyr He 

195 200 205 

Met Asn Ala Gin Val Thr Glu He Val Gly Asp Gly Lys Lys Val Thr 

210 215 220 

Gly Val Lys Tyr Lys Asp Arg Thr Thr Gly Glu Glu Lys Leu lie Glu 
225 230 * 235 ~ 240 

Thr Asp Gly Val Phe He Tyr Val Gly Leu He Pro Gin Thr Ser Phe 

245 250 255 

Leu Lys Asp Ser Gly Val Lys Leu Asp Glu Arg Gly Tyr He Val Val 

260 265 270 

Asp Ser Arg Gin Arg Thr Ser Val Pro Gly Val Tyr Ala Ala Gly Asp 

275 280 285 

Val Thr Ser Gly Asn Phe Ala Gin He Ala Ser Ala Val Gly Asp Gly 

290 295 300 

Cys Lys Ala Ala Leu Ser Leu Tyr Ser Asp Ser He Ser Lys Lys 
305 310 315 



<210> 279 
<211> 317 
<212> PRT 

<213> Thermotoga maritima 
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<400> 279 

Met Val Phe Phe Asp Thr Gly Ser Leu Lys Lys Lys Glu lie Lys Asp 

1 5 10 15 

Lys Tyr Asp He Val Val Val Gly Gly Gly Pro Ala Gly Leu Thr Ser 

20 25 30 

Ala He Tyr Ala Arg Arg Ala Gly Leu Ser Val Leu Val Val Glu Lys 

35 40 45 

Ala He Glu Gly Gly Tyr Val Asn Leu Thr His Leu Val Glu Asn Tyr 

50 55 60 

Pro Gly Phe Pro Ala He Ser Gly Glu Glu Leu Ala Ser Lys Phe Lys 
65 70 75 " 80 

Glu His Ala Glu Lys Phe Gly Ala Asp He Tyr Asn Ala Glu Val Val 

85 90 95 

Lys Leu Glu Val Gin Gly Asp Lys Lys Val Val Glu Leu Asp Asp Gly 

100 105 110 

Lys Arg He Glu Ala Pro Val Val He Val Ala Thr Gly Ala Asn Pro 

115 120 125 

Lys Lys Leu Asn Val Pro Gly Glu Lys Glu Phe Phe Gly Lys Gly Val 

130 135 140 

Ser Tyr Cys Ala Thr Cys Asp Gly Tyr Leu Phe Ala Gly Lys Asp Val 
145 150 " * " 155 " 160 

He Val Val Gly Gly Gly Asp Ser Ala Cys Asp Glu Ser He Phe Leu 

165 170 175 ' 

Ser Asn He Val Asn Lys He Thr Met He Gin Leu Leu Glu Thr Leu 

180 185 190 

Thr Ala Ala Lys Val Leu Gin Glu Arg Val Leu Asn Asn Pro Lys He 

195 200 205 

Glu Val He Tyr Asn Ser Thr Val Arg Glu He Arg Gly Lys Asp Lys 

210 215 220 

Val Glu Glu Val Val He Glu Asn Val Lys Thr Gly Glu Thr Lys Val 
225 230 235 240 

Leu Lys Ala Asp Gly Val Phe He Phe He Gly Leu Asp Pro Asn Ser 

245 250 ~ 255 

Lys Leu Leu Glu Gly Leu Val Glu Leu Asp Pro Tyr Gly Tyr Val He 

260 265 270 

Thr Asp Glu Asn Met Glu Thr Ser Val Lys Gly He Tyr Ala Val Gly 

275 280 ^ 285 

Asp Val Arg Lys Lys Asn Leu Arg Gin He Val Thr Ala Val Ala Asp 

290 295 300 

Gly Ala lie Ala Val Glu His Ala Ala Lys His Tyr Phe 
305 310 315 



<210> 280 
<211> 326 
<212> PRT 

<213> Thermoplasma volcanium 



<400> 280 



Met 


Asn 


Leu 


Tyr 


Arg 


Gly 


Met 


Glu Phe Asn Leu Arg Ser Val Ser Thr 


1 








5 






10 15 


Glu 


Ala 


Lys 


Glu 


Arg 


Asp 


Phe 


Asp Val He He He Gly Ala Gly Ala 








20 








25 30 


Ala 


Gly 


Phe 


Ser 


Ala 


Ala 


Val 


Tyr Ala Ser Arg Ser Gly Leu Ser Ala 






35 










40 45 


Val 


He 


Leu 


Asp 


Lys 


Asn 


Val 


Ala Gly Gly Leu Thr Ala Glu Ala Pro 




50 










55 


60 


Leu 


Val 


Glu 


Asn 


Tyr 


Leu 


Gly 


Phe Lys Ser He Val Gly Ser Asp Leu 


65 










70 




75 80 


Ala 


Lys 


Asn 


Phe 


Ala 


Glu 


His 


Ala Ser Glu Tyr Ala Ser He Arg Glu 










85 






90 95 


Gly 


Val 


Glu 


Val 


Lys 


Ser 


Val 


Lys Lys Gly Asp Gly Gly Phe He Val 








100 








105 * 110 


Asp 


Thr 


Ser 


Asp 


Gly 


Glu 


Tyr 


His Ser Lys Tyr He He He Thr Thr 






115 










120 125 


Gly 


Thr 


Thr 


His 


Lys 


His 


Leu 


Gly Val Lys Gly Glu Ala Glu Tyr Phe 




130 










135 


140 
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Gly Lys Gly Val Ser Tyr Cys Ser Thr Cys Asp Gly Tyr Leu Phe Lys 
145 150 155 160 

Asn Lys Asn Val Val Thr lie Gly Gly Gly Asn Ser Gly Ala He Ala 

165- 170 175 

Ala He Ser Met Ser Glu Tyr Val Lys Asn Ala Thr He Val Glu Tyr 

180 k 185 190 

Met Pro Arg Tyr Met Cys Glu Asn Ala Tyr He Glu Glu He Lys Lys 

195 200 205 

Arg Lys He Pro Tyr He Met Asn Ala Gin Val Thr Glu He Val Gly 

210 215 220 

Asp Gly Lys Lys Val Thr Gly Val Lys Tyr Lys Asp Arg Ser Ser Gly 
225 230 235 240 

Glu Glu Lys Thr Leu Pro Ala Asp Gly Val Phe Val Tyr Val Gly Leu 

245 250 255 

lie Pro Gin Thr Ser Phe Leu Lys Asp Ser Gly Val Lys Leu Asp Glu 

260 . 265 270 

Arg Gly Tyr He He Val Asp Gly Arg Gin Arg Thr Asn Val Pro Gly 

275 280 285 

He Tyr Ala Ala Gly Asp Val Thr Ser Gly Ser Phe Ala Gin He Ala 

290 295 300 

Ser Ala Val Gly Asp Gly Cys Lys Ala Ala Leu Ser Leu Tyr Ser Asp 
305 310 315 320 

Thr He Ser Ser Lys Lys 
325 



<210> 281 
<211> 309 
<212> PRT 

<213> Ureaplasma parvum 
<400> 281 

Met Asn Gin Glu Val Tyr Asp Leu Val He He Gly Ala Gly Pro Ala 

15 10 15 

Gly Leu Ala Ala Ala Val Tyr Ala Lys Arg Ser Gly Leu Asn Val He 

20 25 30 

He Val Glu Lys Gin Phe Pro Gly Gly Lys He Ala Leu Thr Ser Asn 

35 40 45 

Val Glu Asn Tyr Leu Gly He Asn Ser He Pro Gly Pro Glu Leu Ala 

50 55 60 

Tyr Lys Met Tyr Glu Gin Val Leu Asn Leu Asn Val Ser He He Tyr 
65 70 75 80 

Glu Ala Ala Asp Glu He Ser Leu Lys Glu Lys Tyr Lys Lys He Lys 

85 90 ~* 95 

Leu Thr Thr Gin Thr Leu He Thr Lys Thr Val He He Ala Thr Gly 

100 105 110 

Thr Glu Asn Arg Arg Leu Asn He Leu Gly Glu Leu Glu Phe Glu Asn 

115 120 125 

Lys Gly He Ser Tyr Cys Ala He Cys Asp Gly Pro Leu Tyr Lys Asn 

130 135 ^ 140 

Lys Ala Val Ser Val He Gly Ser Gly Asn Ser Ala Val Glu Glu Ala 
145 150 155 160 

He Tyr Leu Ala Thr He Ala Lys Glu Val His Leu He Ala Asn Lys 

165 170 175 

Pro Gin Phe Lys Ala Glu Gin Gin Leu Val Gin He Ala Asn Asn Thr 

180 185 190' 

Pro Asn He Lys He Tyr Tyr Asn Lys Gin Thr Phe Glu Phe Phe Gly 

195 200 205 

His Gin Phe Leu Glu Gly Leu Lys Phe Arg Asp Leu He Thr Asn Glu 

210 215 220 

Val Thr Thr Leu Asn He Glu Ala Asn Phe Thr Phe He Gly Leu Leu 
225 230 235 * 240 

Pro Ser Arg He Asn Thr Asn Asn Leu Cys He Phe Asn Glu Val Asn 

245 250 255 

Gly Phe He Thr Thr Asp Lys Asn Met Gin Thr Ser Val Cys Gly He 

260 265 270 

Phe Ala Ala Gly Asp He Val Asp Lys Asn Val Arg Glh He Ala Thr 
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275 280 285 

Ala Thr Asn Asp Gly Val lie Ala Ala Leu Tyr Ala Lys Glu Tyr He 

290 295 300 

Thr Arg Asn Asn Trp 
305 



<210> 282 
. <211> 318 
<212> PRT 

<213> Vibrio cholerae 
<400> 282 

Met Ser Asn Val Lys His Ser Lys Leu Leu He Leu Gly Ser Gly Pro 

15 10 15 

Ala Gly Tyr Thr Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Lys Pro 

20 25 30 

Val Leu Val Thr Gly Met Gin Gin Gly Gly Gin Leu Thr Thr Thr Thr 

35 40 45 

Glu Val Glu Asn Trp Pro Gly Asp Ala Glu Gly Leu Thr Gly Pro Ala 

50 55 60 

Leu Met Glu Arg Met Lys Glu His Ala Glu Arg Phe Asp Thr Glu He 
65 70 75 80 

Val Phe Asp His He Asn Ser Val Asp Leu Ser Ser Arg Pro Phe. Arg 

85 90 95 

Leu Thr Gly Asp Ser Gin Glu Tyr Thr Cys Asp Ala Leu He He Ser 

100 105 110 

Thr Gly Ala Ser Ala Lys Tyr Leu Gly Leu Glu Ser Glu Glu Ala Phe 

115 120 125 

Lys Gly Arg Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe Tyr 

130 135 140 

Arg Asn Gin Lys Val Ala Val Val Gly Gly Gly Asn Thr Ala Val Glu 
145 150 155 160 

Glu Ala Leu Tyr Leu Ser Asn He Ala Ser Glu Val His Leu Val His 

165 170 175 

Arg Arg Asp Ser Phe Arg Ser Glu Lys He Leu He Asp Arg Leu Met 

180 185 190 

Asp Lys Val Ala Asn Gly Asn He Val Leu His Thr His Arg Thr Leu 

195 " 200 205 

Asp Glu Val Leu Gly Asp Glu Met Gly Val Thr Gly Val Arg Leu Lys 

210 215 220 

Asp Thr Gin Ser Asp Met Thr Glu Asn Leu Asp Val Met Gly Val Phe 
225 230 235 *" 240 

He Ala He Gly His Gin Pro Asn Ser Gin He Phe Glu Gly Gin Leu 

245 250 255 

Glu Met Lys Asn Gly Tyr He Val Val Lys Ser Gly Leu Glu Gly Asn 

260 ~ 265 * ^ 270 

Ala Thr Gin Thr Ser He Glu Gly Val Phe Ala Ala Gly Asp Val Met 

275 280 285 

Asp His Asn Tyr Arg Gin Ala He Thr Ser Ala Gly Thr Gly Cys Met 

290 ~ 295 300 

Ala Ala Leu Asp Ala Glu Arg Tyr Leu Asp Ser Gin Gly Lys 
305 310 ~ 315 



<210> 283 
<211> 321 
<212> PRT 

<213> Xylella fastidiosa 
<400> 283 

Met Ser Asp Tyr Pro Ala Ser Ala 

1 5 
Gly Ser Gly Pro Ala Gly Trp Thr 
20 

Asn Leu Gin Pro Val Leu He Thr 
35 40 



Lys His Ser Arg Leu Leu He Leu 

10 15 
Ala Ala Val Tyr Ala Ala Arg Ala 
25 30 
Gly Leu Gin Gin Gly Gly Gin Leu 
45 
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Met 


Thr 


Thr Thr 


Glu Val 


Asp 


Asn 




50 






55 




Met 


Gly 


Pro Asp 


Leu Met 


Glu 


Arg 


65 






70 






Asp 


Thr 


Lys Val 


He Phe 


Asp 


Gin 








85 






Arg 


Pro 


Phe Thr 


Leu Phe 


Gly 


Asp 






100 








Leu 


He 


He Ala 


Thr Gly 


Ala 


Asn 






115 




120 


Glu 


Glu 


Ala Phe 


Lys Gly 


Arg 


Gly 




130 






135 




Gly 


Phe 


Phe Tyr 


Arg Asp 


Gin 


Asp 


145 






150 






Thr 


Ala 


Val Glu 


Glu Ala 


Leu 


Tyr 








165 






Tyr 


Leu 


He His 


Arg Arg 


Asp 


Lys 






180 








Asn 


Lys 


Leu Phe 


Ser Lys 


Ala 


Ala 




195 






200 


Asn 


Asn 


Ala Val 


Glu Glu 


Val 


Leu 




210 






215 




Val 


Arg 


He Arg 


Ser Thr 


Gin 


Asp 


225 






230 






Gin 


Gly 


Leu Phe 


Val Ala 


He 


Gly 






245 






Ala 


Gly 


Gin Leu 


Ala Met 


Asn 


Asn 




260 








Thr 


Ala 


Gly Asn 


Val Thr 


Gin 


Thr 






275 






280 


Gly 


Asp 


Val Ala 


Asp Gin 


His 


Tyr 




290 






295 




Phe 


Gly 


Cys Met 


Ala Ala 


Leu 


Asp 


305 






310 







Asn 



<210> 284 

<211> 318 

<212> PRT 

<213> Zymomonas mobilis 



<400> 284 



Met 


Ser 


Ala 


Asp 


Pro 


He 


Ser 


Thr 


1 






5 








Pro 


Ala 


Gly 


Leu 


Thr 


Ala 


Ala 


He 








20 










Pro 


He 


Val 


Ala 


Gin 


Gly 


Leu 


Gin 






35 










40 


Thr 


Glu 


Val 


Glu 


Asn 


Phe 


Pro 


Gly 




50 










55 




Trp 


Leu 


Met 


Glu 


Glu 


Met 


Gin 


Ala 


65 










70 






Leu 


Val 


Trp 


Asp 


He 


He 


Thr 


Ser 










85 








Arg 


Leu 


Met 


Gly 


Asp 


Gly 


Gly 


Gin 








100 










He 


Ser 


Thr 


Gly 


Ala 


Gin 


Ala 


Arg 






115 










120 


Ala 


Leu 


Arg 


Gly 


Lys 


Gly 


He 


Ser 




130 










135 




Phe 


Phe 


Arg 


Gly 


Lys 


Lys 


Val 


Val 


145 










150 






Val 


Glu 


Glu 


Ala 


Leu 


Tyr 


Leu 


Thr 










165 








He 


His 


Arg 


Arg 


Asp 


Ser 


Leu 


Arg 



Trp 


Pro 


Gly 


Asp 


Ala 


His 


Gly Leu 






60 








Met 


Gin 


Ala 


His 


Ala 


Glu 


Arg Phe 






75 








80 


He 


Tyr 


Lys 


Ala 


Asp 


Leu 


Ser Thr 




90 








95 


Ser 


Gly 


Leu 


Tyr 


Thr 


Cys 


Asp Gly 


105 










110 




Ala 


Lys 


Tyr 


Leu 


Gly 


He 


Pro Ser 










125 






Val 


Ser 


Ala 


Cys 


Ala 


Thr 


Cys Asp 








140 








Val 


Ala 


Val 


He 


Gly 


Gly 


Gly Asn 






155 








16a 


Leu 


Ser 


Asn 


He 


Ala 


Arg 


Lys Val 




170 










175 


Leu 


Arg 


Ala 


Glu 


Lys 


He 


Met Gin 


185 






190 




Thr 


Gly 


Lys 


He 


Glu 


Leu 


He Trp 










205 






Gly 


Asn 


Asp 


Ala 


Ser 


Val 


Thr 5 Gly 








220 








Ser 


Ser 


Thr 


Arg 


Asp 


He 


Asp Val 






235 








- 240 


His 


His 


Pro 


Asn 


Thr 


Asp 


Leu Phe 




250 










255 


Gly 


Tyr 


Leu 


Gin 


He 


His 


Ser Gly 


265 










270 




Ser 


Val 


Glu 


Gly 


Val 


Phe 


Ala Ala 








285 






Arg 


Gin 


Ala 


He 


Thr 


Ser 


Ala Gly 








300 








Ala 


Glu 


Arg 


Phe 


Leu 


Asp 


Lys Gly 



315 320 



Arg 


Val 


Phe 


He 


Leu 


Gly 


Ser Gly 




10 










15 




Tyr 


Ala 


Ala 


Arg 


Ala 


Gly 


Leu 


Asn 


25 










30 






Pro 


Gly 


Gly 


Gin 


Leu 
45 


Thr 


lie 


Thr 


Phe 


Arg 


Glu 


Pro 


He 


Gin 


Gly Pro 






60 










Gin 


Ala 


Glu 


Asn 


Val 


Gly 


Ala 


Lys 






75 








80 


Val 


Asp 
90 


Phe 


Ser 


Gin 


Arg 


Pro 
95 


Tyr 


Val 


Tyr 


Leu 


Ala 


Asp 


Ser 


Leu 


He 


105 










110 






Trp 


Leu 


Gly 


Leu 


Glu 
125 


Ser 


Glu 


Thr 


Ala 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly Phe 








140 










Val 


He 


Gly 
155 


Gly 


Gly 


Asn 


Thr 


Ala 
160 


Asn 


His 
170 


Ser 


Pro 


Glu 


Val 


Thr 
175 


Leu 


Ala 


Glu 


Lys 


He 


Met 


Gin 


Lys 


Arg 
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180 



Leu 


Leu 


Ala Asn Pro Lys 


He Lys 






195 


200 


Glu 


Phe 


He Ala Gly Glu 


Asp Ser 




210 


215 


Asp 


Thr 


Lys Thr Gly Glu 


Glu Ser 


225 




230 




He 


Ala 


He Gly His Lys 


Pro Ala 






245 




Lys 


Leu 


Asp Asp Glu Gly 


Tyr -He 






260 




Thr 


Ser 


He Lys Gly He 


Phe Ala 






275 


280 


Tyr 


Arg 


Gin Ala Val Thr 


Ala Ala 




290 




295 


Glu 


Ala 


Glu Arg Phe Leu 


Gly Glu 


305 




310 





185 190 
He Arg Trp Asn Ser Glu Val Ala 
205 

Ala Leu Ser Ala Val Lys Leu Lys 
220 

Leu Leu Glu Thr Glu Gly Ala Phe 
235 240 
Thr Glu Leu Phe Gin Gly His Leu 

250 ~ 255 

Glu Val Thr Pro Gly Thr Thr Gin 
265 270 
Cys Gly Asp Val Met Asp Lys His 
285 

Gly Thr Gly Cys Met Ala Ala Leu 
300 

He Asp Phe Lys Glu Asp 
315 



<210> 285 
<211> 122 
<212> PRT 
<213> Bos taurus 

<400> 285 



Lys 


Leu 


Met 


His 


Gin Ala 


Ala Leu Leu 


Gly Gin Ala Leu Thr Asp Ser 


1 








5 




10 15 


Arg 


Lys 


Phe 


Gly 


Trp Glu 


Tyr Ser Gin 


Gin Val Arg His Ser Trp Ala 








20 




25 


30 


Thr 


Met 


Thr 


Glu 


Ala He 


Gin Ser His 


He Gly Ser Leu Ser Trp Gly 






35 






40 


45 


His 


Arg 


Leu 


Ala 


Leu Arg 


Glu Lys Ala 


Val Thr Tyr Val Asn Ser Phe 




50 








55 


60 


Gly 


Glu 


Phe 


Val 


Glu His 


His Lys Val 


Lys Ala Thr Asn Glu Lys Gly 


65 








70 




75 * 80 


Gin 


Glu 


Val 


Leu 


Tyr Thr 


Ala Ala Lys 


Phe Val He Ala Thr Gly Glu 










85 




90 95 


Arg 


Pro 


Arg 


Tyr 


Leu Gly 


He Pro Gly 


Asp Arg Glu Tyr Cys He Thr 








100 




105 


110 


Ser 


Asp 


Asp 


Leu 


Phe Ser 


Leu Pro Tyr 


Cys 






115 






120 





<210> 286 

<211> 511 

<212> PRT 

<213> Bos taurus 



<400> 286 



Met 


Ala 


Ala 


Leu 


Arg Gly 


Ala Ala Ala 


Arg Phe Arg Gly Arg Ala Pro 


1 








5 




10 15 


Gly 


Gly 


Ala 


Arg 


Gly Ala 


Ala Gly Arg 


Gin Cys Tyr Asp Leu Leu Val 








20 




25 


30 


He 


Gly 


Gly 


Gly 


Ser Gly 


Gly Leu Ala 


Cys Ala Lys Glu Ala Ala Gin 






35 






40 


45 


Leu 


Gly 


Lys 


Lys 


Val Ala 


Val Leu Asp 


Tyr Val Glu Pro Ser' Pro Gin 




50 






55 


60 


Gly 


Thr 


Arg 


Trp 


Gly Leu 


Gly Gly Thr 


Cys Val. Asn Val Gly Cys He 


65 








70 




75 80 


Pro 


Lys 


Lys 


Leu 


Met His 


Gin Ala Ala 


Leu Leu Gly Gly Met He Arg 










85 




90 95 


Asp 


Ala 


Pro 


His 


Tyr Gly 


Trp Gly Val 


Ala Gin Ala Pro His Ser Trp 








100 




105 


110 


Ala 


Thr 


Leu 


Ala 


Asp Ala 


Val Gin Asn 


His Val Lys Ser Leu Asn Trp 






115 






120 


125 


Gly 


His 


Arg 


He 


Gin Leu 


Gin Asp Arg 


Lys Val Lys Tyr Phe Asn Val 



130 135 140 
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Lys Ala Ser Phe Val Asp Thr His Thr Val Cys Gly Val Ser Lys Gly 
145 150 155 160 

Gly Glu Glu Thr Leu Leu Ser Ala Glu His lie Val lie Ala Thr Gly 

165 170 " 175 

Gly Arg Pro Arg Tyr Pro Thr His lie Glu Gly Ala Leu Glu Tyr Gly 

180 185 ~ 190 

He Thr Ser Asp Asp Leu Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr 

195 " 200 205 

Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Leu Leu 

210 215 220 

Thr Gly Leu Gly Leu Asp Thr Thr Val Met He Arg Ser Val Pro Leu 
225 230 235 240 

Arg Ala Phe Asp Gin Gin Met Ala Ser Leu Val Thr Glu His Met Ala 

245 250 255 

Gly His Gly Thr Arg lie Leu Arg Gly Cys Ala Pro Glu Lys Val Glu 

260 265 270 

Lys Leu Pro Gly Gin Gin Leu Arg Val Thr Trp Val Asp Leu Thr Ser 

275 J 280 285 

Asp Arg Lys Asp Ala Gly Thr Phe Asp Thr Val Leu Trp Ala He Gly 

290 295 300 

Arg Val Pro Glu Thr Ala Ser Leu Asn Leu Glu Lys Ala Gly Val His 
305 310 315 * ^ 320 

Thr Asn Pro Val Thr Gly Lys He Leu Val Asp Ala Gin Glu Thr Thr 

325 330 335 

Ser Val Pro His He Tyr Ala He Gly Asp Val Ala Glu Gly Arg Pro 

340 345 350 

Glu Leu Thr Pro Thr Ala He Met Ala Gly Arg Leu Leu Ala Gin Arg 

355 360 " "* 365 

Leu Ser Gly Arg Thr Ser Asp Leu Met Asp Tyr Ser Ser Val Pro Thr 

370 375 380 

Thr Val Phe Thr Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu Glu 
385 390 395 400 

Ala Ala Val Ala Arg His Gly Glu Glu His Val Glu Val Tyr His Ala 

405 410 415 

Phe Tyr Lys Pro Leu Glu Phe Thr Val Pro Gin Arg Asp Ala Ser Gin 

420 425 430 

Cys Tyr He Lys Met Val Cys Leu Arg Glu Pro Pro Gin Leu Val Leu 

435 rf 440 445 

Gly Leu His Phe Leu Gly Pro Asn Ala Gly Glu Val He Gin Gly Phe 

450 455 460 

Ala Leu Gly He Lys Cys Gly Ala Ser Tyr Gin Gin Leu Met Arg Thr 
465 470 475 480 

Val Gly He His Pro Thr Cys Ala Glu Glu Val Ala Lys Leu Arg He 

485 490 495 

Ser Lys Arg Ser Gly Leu Asp Pro Thr Val Thr Gly Cys Cys Gly 
500 505 510 



<210> 287 
<211> 525 
<212> PRT 

<213> Caenorhabditis elegans 
<220> 

<221> VARIANT 
<222> 524 

<223> Xaa = Any Amino Acid 
<400> 287 

Met Tyr He Lys Gly Asn Ala Val 

1 5 
Leu Lys Gin Asp Tyr Leu Lys Glu 
20 

Leu He Val He Gly Gly Gly Ser 

35 40 
Ala Ser Arg Leu Gly Lys Lys Val 
50 55 



Gly Gly Leu Lys Glu Leu Lys Ala 

10 15 
Trp Leu Arg Asp His Thr Tyr Asp 
25 30 
Gly Gly Leu Ala Ala Ala Lys Glu 
45 

Ala Cys Leu Asp Phe Val Lys Pro 
60 
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Ser Pro Gin Gly Thr Ser Trp Gly Leu Gly Gly Thr Cys Val Asn Val 
65 70 75 80 

Gly Cys He Pro Lys Lys Leu Met His Gin Ala Ser Leu Leu Gly His 

85 90 95 

Ser He His Asp Ala Lys Lys Tyr Gly Trp Lys Leu Pro Glu Gly Lys 

100 105 110 

Val Glu His Gin Trp Asn His Leu Arg Asp Ser Val Gin Asp His He 

115 120 125 

Ala Ser Leu Asn Trp Gly Tyr Arg Val Gin Leu Arg Glu Lys Thr Val 

130 135 140 

Thr Tyr He Asn Ser Tyr Gly Glu Phe Thr Gly Pro Phe Glu He Ser 
145 150 155 160 

Ala Thr Asn Lys Lys Lys Lys Val Glu Lys Leu Thr Ala Asp Arg Phe 

165 170 175 

Leu He Ser Thr Gly Leu Arg Pro Lys Tyr Pro Glu He Pro Gly Val 

180 185 190 

Lys Glu Tyr Thr He Thr Ser Asp Asp Leu Phe Gin Leu Pro Tyr Ser 

195 200 205 

Pro Gly Lys Thr Leu Cys Val Gly Ala Ser Tyr Val Ser Leu Glu Cys 

210 215 220 

Ala Gly Phe Leu His Gly Phe Gly Phe Asp Val Thr Val Met Val Arg 
225 230 235 240 

Ser He Leu Leu Arg Gly Phe Asp Gin Asp Met Ala Glu Arg He Arg 

245 250 255 

Lys His Met He Ala Tyr Gly Met Lys Phe Glu Ala Gly Val Pro Thr 

260 265 270 

Arg He Glu Gin He Asp Glu Lys Thr Asp Glu Lys Ala Gly Lys Tyr 

275 280 285 

Arg Val Phe Trp Pro Lys Lys Asn Glu Glu Thr Gly Glu Met Gin Glu 

290 295 300 

Val Ser Glu Glu Tyr Asn Thr He Leu Met Ala He Gly Arg Glu Ala 
305 310 315 " 320 

Val Thr Asp Asp Val Gly Leu Thr Thr He Gly Val Glu Arg Ala Lys 

325 330 335 

Ser Lys Lys Val Leu Gly Arg Arg Glu Gin Ser Thr Thr He Pro Trp 

340 345 350 

Val Tyr Ala He Gly Asp Val Leu Glu Gly Thr Pro Glu Leu Thr Pro 

355 360 365 

Val Ala He Gin Ala Gly Arg Val Leu Met Arg Arg He Phe Asp Gly 

370 375 380 

Ala Asn Glu Leu Thr Glu Tyr Asp Gin He Pro Thr Thr Val Phe Thr 
385. 390 395 400 

Pro Leu Glu Tyr Gly Cys Cys Gly Leu Ser Glu Glu Asp Ala Met Met 

405 410 415 

Lys Tyr Gly Lys Asp Asn He He He Tyr His Asn Val Phe Asn Pro 

420 425 430 

Leu Glu Tyr Thr He Ser Glu Arg Met Asp Lys Asp His Cys Tyr Leu 

435 440 ~ 445 

Lys Met He Cys Leu Arg Asn Glu Glu Glu Lys Val Val Gly Phe His 

450 455 ~ 460 

He Leu Thr Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Gly He Ala 
465 470 475 480 

Leu Lys Leu Ala Ala Lys Lys Ala Asp Phe Asp Arg Leu He Gly He 

485 490 495 

His Pro Thr Val Ala Glu Asn Phe Thr Thr Leu Thr Leu Glu Lys Lys 

500 505 510' 

Glu Gly Asp Glu Glu Leu Gin Ala Ser Gly Cys Xaa Gly 
515 520 525 



<210> 288 
<211> 667 
<212> PRT 

<213> Caenorhabditis elegans 
<220> 

<221> VARIANT 
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<222> 666 

<223> Xaa = Any Amino Acid 



<400> 288 



Met 


Lys 
2 *~ 


Ser 


Leu 


Thr 


Glu 


Leu 


Phe 


Gly Cys 


Phe 


Lys Arg Gin Pro Arg 


1 






5 








10 




15 


Gin 


Gin 


Glu 


Ala 


Ser 


Ser 


Pro 


Ala 


Asn Pro 


His 


Val Ser Asp Thr Leu 








20 










25 




30 


Ser 


Met 


Gly 


Val 


Ala 


Ala 


Ser 


Gly 


Met Pro 


Pro 


Pro Lys Arg Pro Ala 






35 










40 






45 


Pro 


Ala 


•Glu 


Ser 


Pro 


Thr 


Leu 


Pro 


Gly Glu 


Thr 


Leu Val Asp Ala Pro 




50 










55 








60 


Gly 


He 


Pro 


Leu 


Lys 


Glu 


Ala 


Leu 


Lys Glu 


Ala 


Ala Asn Ser Lys He 


65 








70 








75 


80 


Val 


lie 


Phe 


Tyr 


Asn 


Ser 


Ser 


Asp 


Glu Glu 


Lys 


Gin Leu Val Glu Phe 








85 








90 




95 


Glu 


Thr 


Tyr 


Leu 


Asn 


Ser 


Leu 


Lys 


Glu Pro 


Ala 


Asp Ala Glu Lys Pro 






100 










105 




110 


Leu 


Glu 


He 


Pro 


Glu 


He 


Lys 


Lys 


Leu Gin 


Val 


Ser Arg Ala Ser Gin 






115 










120 






125 


Lvs 


Val 


He 


Gin 


Tyr 


Leu 


Thr 


Leu 


His Thr 


Ser 


Trp Pro Leu Met Tyr 




130 










135 








140 


He 


Lvs 


Gly 


Asn 


Ala 


Val 


Gly 


Gly 


Leu Lys 


Glu 


Leu Lys Ala Leu Lys 


145 










150 








155 


- 160 


Gin 


ASP 


Tvr 


Leu 


Lvs 


Glu 


Trp 


fceu 


Arg Asp 


His 


Thr Tyr Asp Leu He 










165 








170 




175 


Val 


He 


Glv 


Glv 


Glv 


Ser 


Glv 


Gly 

J: 


Leu Ala 


Ala 


Ala Lys Glu Ala Ser 








180 










185 




190 


Ara 


Leu 


Glv 


Lys 


Lys 


Val 


Ala 


Cys 


Leu Asp 


Phe 


Val Lys Pro Ser Pro 






195 










200 






205 


Gin 


Glv 


Thr 


Ser 


Trp 


Glv 


Leu 


Gly 


Gly Thr 


Cys 

2 w 


Val Asn Val Gly Cys 




210 










215 








220 


He 


Pro 


Lvs 


Lvs 


Leu 


Met 


His 


Gin 


Ala Ser 


Leu 


Leu Gly His Ser He 


225 






230 








235 


240 


His 


ASD 

****** 


Ala 


Lvs 


Lvs 


Tvr 

Jr 


Gly 


Trp 


Lys Leu 


Pro 


Glu Gly Lys Val Glu 










245 








250 




255 


His 


Gin 




Asn 


His 


Leu 


Arq 


Asp 


Ser Val 


Gin 


Asp His He Ala Ser 








260 










265 




270 


Leu 


Asn 


Trp 


Glv 


Tvr 


Ara 


Val 


Gin 


Leu Arg 


Glu 


Lys Thr Val Thr Tyr 






275 










280 






285 


He 


Asn 


Ser 


Tvr 


Gly 


Glu 


Phe 


Thr 


Gly Pro 


Phe 


Glu He Ser Ala Thr 




290 






295 






300 


Asn 


Lvs 


Lvs 


Lvs 

"Jr 0 


Lys 


Val 


Glu 


Lvs 


Leu Thr 


Ala 


Asp Arg Phe Leu He 


305 










310 








315 


320 


Ser 


Thr 


Gly 


Leu 


Atq 


Pro 


Lvs 


Tvr 


Pro Glu 


He 


Pro Gly Val Lys Glu 










325 








330 




335 


Tyr 


Thr 


He 


Thr 


Ser 


Asp 

XT 


Asp 


Leu 


Phe Gin 


Leu 


Pro Tyr Ser Pro Gly 








340 










345 




350 


Lys 


Thr 


Leu 


Cys 


Val 


Gly 


Ala 


Ser 


Tyr Val 


Ser 


Leu Glu Cys Ala Gly 






355 










360 






365 


Phe 


Leu 


His 


Gly 


Phe 


Gly 


Phe 


Asp 


Val Thr 


Val 


Met Val Arg Ser He 




370 










375 








380 


Leu 


Leu 


Ara 


Glv 


Phe 


Asp 


Gin 


Asp 


Met Ala 


Glu 


Arg He Arg Lys His 


385 










390 








395 


400 


Met 


He 


Ala 


Tvr 

2 


Gly 


Met 


Lvs 


Phe 


Glu Ala 


Gly 

2 


Val Pro Thr Arg He 










405 








410 




' 415 


Glu 


Gin 


He 


Asp 


Glu 


Lys 


Thr 


Asp 


Glu Lys 


Ala 


Gly Lys Tyr Arg Val 








420 










425 




430 


Phe 


Trp 


Pro 


Lys 


Lys 


Asn 


Glu 


Glu 


Thr Gly 


Glu 


Met Gin Glu Val Ser 






435 










440 






445 


Glu 


Glu 


Tyr 


Asn 


Thr 


He 


Leu 


Met 


Ala He 


Gly 


Arg Glu Ala Val Thr 




450 








455 








460 


Asp 


Asp 


Val 


Gly 


Leu 


Thr 


Thr 


He 


Gly Val 


Glu 


Arg Ala Lys Ser Lys 


465 










470 








475 


480 


Lys 


Val 


Leu 


Gly 


Arg 


Arg 


Glu 


Gin 


Ser Thr 


Thr 


He Pro Trp Val Tyr 










485 








490 




495 


Ala 


He 


Gly 


Asp 


Val 


Leu 


Glu 


Gly 


Thr Pro 


Glu 


Leu Thr Pro Val Ala 
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500 








505 


510 


lie 


Gin 


Ala 


Gly 


Arg 


Val 


Leu 


Me t Ara Ara lie Phe 


Asd Glv Ala Asn 

vj_i__y /ixn noil 






515 








520 


525 


Glu 


Leu 


Thr 


Glu 




Asp 


Gin 


lie Prn Thr Thr Val 

J- V»» J. JU \S Alii. A11X V Ul 


Phe Thr Prn Tien 
rue iiii. riu jjcu 




530 






535 


540 




Glu 


Tvr 


m v 

wl y 


L-y o 


y o 


VJl Jr 


Leu 


S<»r Gl n Gl ii Acitj Al a 

uCi V?l LL VJ1 u nou jrllct 


IVTof- Mp>t~ TiVQ Tvr 
vie L- i"ic l. n y lyi 


545 










J JU 




555 


560 


Glv 


a 


Ren 


ash 


Tl ~ 

lie 


Tl 

11C 


ne 


Tvr TT*i st A<sn Val Ph o 
lyi ma <rvoii v cli it lie 


Aon Pr/"> Titan Gin 
rioil riU lie LI ulU 








565 






570 


575 




Thr 

1.111 


lie 


Car 
OCX 


V7l Li 




i T ic l> 


riop JJjf O Xllb Ljo 


Tvr T.011 T.vc Mot" 
1 y 1 lie u. uy 0 I'lc u 








580 








585 


590 


lie 


Cys 






7\ cm 


V3l LI 


VJ1 LI 


Gl ii T.vq Val Val Glv 


ir ne 111 0 lie ucu 






595 










600 


605 


Thr 


Pro 


Asn 


Ala 


Glv 


Glu 


Val 


Thr Gin Glv Phe Glv 

jl iix vjxii vjx y t ii^ vjju y 


Tie Ala Tien Tivci 




610 








615 


620 


Leu 


Ala 


Ala 


Lys 


Lys 


Ala 


Asp 


Phe Asp Arg Leu lie 


Gly lie His Pro 


625 










630 




635 


640 


Thr 


Val 


Ala 


Glu 


Asn 


Phe 


Thr 


Thr Leu Thr Leu Glu 


Lys Lys Glu Gly 










645 






650 


655 


Asp 


Glu 


Glu 


Leu 


Gin 


Ala 


Ser 


Gly Cys Xaa Gly 










660 








665 





<210> 289 
<211> 516 
<212> PRT 

<213> Drosophila raelanogaster 
<400> 289 

Met Ser Thr lie Lys Phe Leu Arg Ser Ser Thr His Asn Ala Leu Arg 

1 5 ~ 10 15 

Ser Ser Leu Gly Trp Cys Arg Leu Ala Ala Ser Arg Pro Arg Tyr Asp 

20 25 30 

Tyr Asp Leu Val Val Leu Gly Gly Gly Ser Ala Gly Leu Ala Cys Ala 

35 " 40 45 

Lys Glu Ala Ala Gly Cys Gly Ala Arg Val Leu Cys Phe Asp Tyr Val 

50 55 60 

Lys Pro Thr Pro Val Gly Thr Lys Trp Gly lie Gly Gly Thr Cys Val 
65 70 75 80 

Asn Val Gly Cys lie Pro Lys Lys Leu Met His Gin Ala Ser Leu Leu 

85 90 95 

Gly Glu Ala Val His Glu Ala Val Ala Tyr Gly Trp Asn Val Asp Asp 

100 105 110 

Thr Asn lie Arg Pro Asp Trp Arg Lys Leu Val Arg Ser Val Gin Asn 

115 " 120 ~ " 125 

His He Lys Ser Val Asn Trp Val Thr Arg Val Asp Leu Arg Asp Lys 

130 135 140 

Lys Val Glu Tyr Val Asn Ser Met Ala Thr Phe Arg Asp Ser His Thr 
145 150 155 160 

He Glu Tyr Val Ala Met Pro Gly Ala Glu His Arg Gin Val Thr Ser 

165 170 175 

Glu Tyr Val Val Val Ala Val Gly Gly Arg Pro Arg Tyr Pro Asp He 

180 185 " ~ 190 

Pro Gly Ala Val Glu Leu Gly He Thr Ser Asp Asp He Phe Ser Tyr 

195 200 205 

Glu Arg Glu Pro Gly Arg Thr Leu Val Val Gly Ala Gly Tyr Val Gly 

210 215 220 

Leu Glu Cys Ala Cys Phe Leu Lys Gly Leu Gly Tyr Glu Pro Thr Val 
225 230 235 240 

Met Val Arg Ser He Val Leu Arg Gly Phe Asp Arg Gin Met Ser Glu 

245 * 250 255 

Leu Leu Ala Ala Met Met Thr Glu Arg Gly He Pro Phe Leu Gly Thr 

260 265 270 

Thr He Pro Lys Ala Val Glu Arg Gin Ala Asp Gly Arg Leu Leu Val 

275 280 285 

Arg Tyr Arg Asn Thr Thr Thr Gin Met Asp Gly Ser Asp Val Phe Asp 
290 295 300 
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Thr Val Leu Trp Ala lie Gly Arg Lys Gly Leu lie Glu Asp Leu Asn 
305 310 ~ " 315 320 

Leu Asp Ala Ala Gly Val Lys Thr His Asp Asp Lys He Val Val Asp 

325 ~ 330 ~ * 335 

Ala Ala Glu Ala Thr Ser Val Pro His He Phe Ala Val Gly Asp He 

340 -345 350 

He Tyr Gly Arg Pro Glu Leu Thr Pro Val Ala He Leu Ser Gly Arg 

355 360 365 

Leu Leu Ala Arg Arg Leu Phe Ala Gly Ser Thr Gin Leu Met Asp Tyr 

370 375 380 

Ala Asp Val Ala Thr Thr Val Phe Thr Pro Leu Glu Tyr Ser Cys Val 
385 390 395 400 

Gly Met Ser Glu Glu Thr Ala He Glu Leu Arg Gly Ala Asp Asn He 

405 410 415 

Glu Val Phe His Gly Tyr Tyr Lys Pro Thr Glu Phe Phe . He Pro Gin 

420 425 430 

Lys Ser Val Arg His Cys Tyr Leu Lys Ala Val Ala Glu Val Ser Gly 

435 440 445 

Asp Gin Lys lie Leu Gly Leu His Tyr He Gly Pro Val Ala Gly Glu 

450 * 455 460 

Val He Gin Gly Phe Ala Ala Ala Leu Lys Thr Gly Leu Thr Val Lys 
465 470 475 480 

Thr Leu Leu Asn Thr Val Gly He His Pro Thr Thr Ala Glu Glu Phe 

485 * 490 495" 

Thr Arg Leu Ser He Thr Lys Arg Ser Gly Arg Asp Pro Thr Pro Ala 

500 " 505 510 

Ser Cys Cys Ser 
515 



<210> 290 

<211> 5-24 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 523 

<223> Xaa = Any Amino Acid 
<400> 290 

Met Ala Ala Met Ala Val Ala Leu Arg Gly Leu Gly Gly Arg Phe Arg 

15 10 15 

Trp Arg Thr Gin Ala Val Ala Gly Gly Val Arg Gly Ala Ala Arg Gly 

20 25 30 

Ala Ala Ala Gly Gin Arg Asp Tyr Asp Leu Leu Val Val Gly Gly Gly 

35 40 45 

Ser Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly Arg Lys 

50 55 60 

Val Ser Val Val Asp Tyr Val Glu Pro Ser Pro Gin Gly Thr Arg Trp 
65 70 75 80 

Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys He Pro Lys Lys Leu 

85 90 95 

Met His Gin Ala Ala Leu Leu Gly Gly Leu lie Gin Asp Ala Pro Asn 

100 105 110 

Tyr Gly Trp Glu Val Ala Gin Pro Val Pro His Asp Trp Arg' Lys Met 

115 120 125 

Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly His Arg 

130 135 140 

Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn He Lys Ala Ser 
145 150 155 160 

Phe Val Asp Glu His Thr Val Cys Gly Val Ala Lys Gly Gly Lys Glu 

165 ~ 170 175 

He Leu Leu Ser Ala Asp His lie lie lie Ala Thr Gly Gly Arg Pro 

180 185 190 

Arg Tyr Pro Thr His He Glu Gly Ala Leu Glu Tyr Gly He Thr Ser 
195 200 205 
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Asp 


Asp 


He 


Phe 


Trp 


Leu Lys Glu 


Ser Pro Gly Lys Thr Leu Val Val 




210 








215 


220 


Gly 


Ala 


Ser 


Tyr 


Val 


Ala Leu Glu 


Cys Ala Gly Phe Leu Thr Gly He 


225 










230 


235 240 


Gly 


Leu 


Asp 


Thr 


Thr 


He Met Met 


Arg Ser He Pro Leu Arg Gly Phe 










245 




250 255 


Asp 


Gin 


Gin 


Met 


Ser 


Ser Met Val 


He Glu His Met Ala Ser His Gly 








260 






265 270 


Thr 


Arg 


Phe 


Leu 


Ara 


Gly Cys Ala 


Pro Ser Arg Val Arg Arg Leu Pro 






275 






280 


285 


Asp 


Gly 


Gin 


Leu 


Gin 


Val Thr Trp 


Glu Asp Ser Thr Thr Gly Lys Glu 




290 








295 


300 


Asp 


Thr 


Gly 


Thr 


Phe 


Asp Thr Val 


Leu Trp Ala He Gly Arg Val Pro 


305 










310 


315 ^ 320 


Asp 


Thr 


Arg 


Ser 


Leu 


Asn Leu GJu 


Lys Ala Gly Val Asp Thr Ser Pro 










325 




330 335 


Asp 


Thr 


Gin 


Lys 


He 


Leu Val Asp 


Ser Arg Glu Ala Thr Ser Val Pro 








340 






345 ~ 350 


His 


He 


Tyr 


Ala 


He 


Gly Asp Val 


Val Glu Gly Arg Pro Glu Leu Thr 






355 






360 


365 


Pro 


He 


Ala 


He 


Met 


Ala Gly Arg 


Leu Leu Val Gin Arg Leu Phe Gly 




370 








375 


380 


Gly 


Ser 


Ser 


Asp 


Leu 


Met Asp Tyr 


Asp Asn Val Pro Thr Thr Val Phe 


385 










390 


395 • 400 


Thr 


Pro 


Leu 


Glu 


Tvr 


Gly Cys Val 


Gly Leu Ser Glu Glu Glu Ala Val 










405 




410 415 


Ala 


Arg 


His 


Gly 


Gin 


Glu His Val 


Glu Val Tyr His Ala His Tyr Lys 








420 






425 ~ 430 


Pro 


Leu 


Glu 


Phe 


Thr 


Val Ala Gly 


Arg Asp Ala Ser Gin Cys Tyr Val 






435 






440 


445 


Lvs 


Met 


Val 


Cvs 


Leu 


Arg Glu Pro 


Pro Gin Leu Val Leu Gly Leu His 




450 








455 


460 


Phe 


Leu 


Gly 


Pro 


Asn 


Ala Gly Glu 


Val Thr Gin Gly Phe Ala Leu Gly 


465 








470 


475 480 


He 


Lys 


Cys 


Gly 


Ala 


Ser Tyr Ala 


Gin Val Met Arg Thr Val Gly He 










485 




490 495 


His 


Pro 


Thr 


Cys 


Ser 


Glu Glu Val 


Val Lys Leu Arg He Ser Lys Arg 








500 






505 510 


Ser 


Gly 


Leu 


Asp 


Pro 


Thr Val Thr 


Gly Cys Xaa Gly 






515 






520 





<210> 291 

<211> 497 

<212> PRT 

<213> Homo sapiens 



<400> 291 



Met 


Asn 


Gly 


Pro 


Glu 


Asp Leu Pro 


Lys Ser Tyr Asp Tyr Asp Leu He 


1 








5 




10 15 


He 


He 


Gly 


Gly 


Gly 


Ser Gly Gly 


Leu Ala Ala Ala Lys Glu Ala Ala 








20 






25 30 


Gin 


Tyr 


Gly 


Lys 


Lys 


Val Met Val 


Leu Asp Phe Val Thr Pro Thr Pro 






35 






40 


45 


Leu 


Gly 


Thr 


Arg 


Trp 


Gly Leu Gly 


Gly Thr Cys Val Asn Val Gly Cys 




50 








55 


60 


He 


Pro 


Lys 


Lys 


Leu 


Met His Gin 


Ala Ala Leu Leu Gly Gin Ala Leu 


65 








70 


75 80 


Gin 


Asp 


Ser 


Arg 


Asn 


Tyr Gly Trp 


Lys Val Glu Glu Thr Val Lys His 










85 




90 95 


Asp 


Trp 


Asp 


Arg 


Met 


He Glu Ala 


Val Gin Asn His lie Gly Ser Leu 








100 






105 110 


Asn 


Trp 


Gly 


Tyr 


Arg 


Val Ala Leu 


Arg Glu Lys Lys Val Val Tyr Glu 






115 






120 


125 


Asn 


Ala 


Tyr 


Gly 


Gin 


Phe He Gly 


Pro His Arg He Lys Ala Thr Asn 




130 






135 


140 


Asn 


Lys 


Gly 


Lys 


Glu 


Lys He Tyr 


Ser Ala Glu Arg Phe Leu He Ala 
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145 




1 50 

lJU 


xnx vj_L_y 


(H "111 A yrr 

vsiu. Axy 


JrxO niy 






1 A*5 

lO J 


flTCJ "1*1 A 

i-»y £3 lie 


_ 

a6r S62T 


Asp Afc>p 




180 




x iii_ ucu 


val Veil 


oiy nia 




195 




JJCU A±A 


>jxy lie 


wxy ucu 


210 








oiy xr iic 


len f2l ri 
nop V7X1X 


225 




230 


uXU vjXU. 


alb vjl y 


Tl p T.I^Ci 

lie xiyo 






245 


UXU VjXIl 


Tip ni m 


Ala viiy 




Z D U 




Got" TVl -y 




V71U UlU 




275 




LiGU Ala. 


Tip fll vr 
116 vjly 


niy Abp 


£i Z7 \J 






Vsal ("2*1 v 
Val vsiy 


V al liy o 


Tl q 7V en 
11CS XT. to 11 


1 0 5 

J u J 




Jlw 


UXU V3JLLL 


vjrxn x xix 


noil vdx 






^? 5 


Ul U. ASp 


u y o val 


/xT ii T>aii 




*"i40 




ilcU Ala 


f2T n firrr 
VljXII Hiy 


ucu xyx 




"3 55 

J «J u 




7\on 17a 1 


rxO JLiix 


inr va.1. 








Leu Seir 


bJLU bill 


xiys Ala 


*5 c 






vai lyx 


His Ser 


lyr fixe 








Asp ash 


ASH xjy S 


tys lyx 




f± Z u 




«X LL AX y 


Val Veil 


Uiy IT 11C 




435 




Thr Gin 


Gly Phe 


Ala Ala 


450 






Leu Asp 


Ser Thr 


He Gly 


465 




470 


Thr Leu 


Ser Val 


Thr Lys 






485 


Cys 







155 

Tyr Leu Gly He Pro 
170 

Leu Phe Ser Leu Pro 
185 

Ser Tyr Val Ala Leu 
200 

Asn Val . Thr Val Met 
215 

Asp Met Ala Asn Lys 
235 

Phe He Arg Gin Phe 
250 

Thr Pro Gly Arg Leu 
265 

He He Glu Gly Glu 
280 

Ala Cys Thr Arg Lys 
295 

Glu Lys Thr Gly Lys 
315 

Pro Tyr He Tyr Ala 
330 

Thr Pro Val Ala He 
345 

Ala Gly Ser Thr Val 
360 

Phe Thr Pro Leu Glu 
375 

Val Glu Lys Phe Gly 
395 

Trp Pro Leu Glu Trp 
410 

Ala Lys He He Cys 
425 

His Val Leu Gly Pro 
440 

Ala Leu Lys Cys Gly 
455 

He His Pro Val Cys 
475 

Arg Ser Gly Ala Arg 
490 



160 

Gly Asp Lys Glu Tyx 
175 

Tyr Cys Pro Gly Lys 
190 

Glu Cys Ala Gly Phe 
205 

Val Arg Ser He Leu 
220 

He Gly Glu His Met 
240 

Val Pro He Lys Val 
255 

Arg Val Val Ala Gin 
270 

Tyr Asn Thr Val Met 
285 

He Gly Leu Glu Thr 
300 

He Pro Val Thr Asp 
320 

He Gly Asp He Leu 
335 

Gin Ala Gly Arg' Leu 
350 

Lys Cys Asp Tyr Glu 
365 

Tyr Gly Ala Cys Gly 
380 

Glu Glu Asn He Glu 
400 

Thr He Pro Ser Arg 
415 

Asn Thr Lys Asp Asn 
430 

Asn Ala Gly Glu Val 
445 

Leu Thr Lys Lys Gin 
460 

Ala Glu Val Phe Thr 
480 

He Leu Gin Ala Gly 
495 



<210> 292 

<211> 497 

<212> PRT 

<213> Homo sapien 



<400> 292 



Met Asn 


Gly 


Pro 


Glu Asp 


Leu Pro 


1 






5 




He He 


Gly 


Gly 


Gly Ser 


Gly Gly 






20 






Gin Tyr 


Gly 


Lys 


Lys Val 


Met Val 




35 






40 


Leu Gly 


Thr 


Arg 


Trp Gly 


Leu Gly 


50 








55 


He Pro 


Lys 


Lys 


Leu Met 


His Gin 


65 






70 




Gin Asp 


Ser 


Arg 


Asn Tyr 


Gly Trp 








85 




Asp Trp 


Asp 


Arg 


Met He 


Glu Ala 






100 







Lys Ser Tyr Asp Tyr Asp Leu He 

10 15 
Leu Ala Ala Ala Lys Glu' Pro Ala 
25 30 
Leu Asp Phe Gly Thr Pro Thr Pro 
45 

Gly Thr Cys Val Asn Val Gly Cys 
60 

Ala Ala Leu Leu Gly Gin Ala Leu 

75 80 
Lys Val Glu Glu Thr Val Lys His 

90 95 
Val Gin Asn His He Gly Ser Leu 
105 110 
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Asn Trp 

Asn Ala 
130 
Asn Lys 
145 

Thr Gly 

Cys lie 

Thr Leu 

Leu Ala 
210 
Leu Arg 
225 

Glu Glu 

Glu Gin 

Ser Thr 

Leu Ala 
290 
Val Gly 
305 

Glu Glu 

Glu Asp 

Leu Ala 

Asn Val 
370 
Leu Ser 
385 

Val Tyr 

Asp Asn 

Glu Arg 

Thr Gin 
450 
Leu Asp 
465 

Thr Leu 
Cys 



Gly Tyr 
115 

Tyr Gly 

Gly Lys 

Glu Arg 

Ser Ser 
180 
Val Val 
195 

Gly lie 

Gly Phe 

His Gly 

He Glu 
260 
Asn Ser 
275 

He Gly 

Val Lys 

Gin Thr 

Lys Val 
340 
Gin Arg 
355 

Pro Thr 

Glu Glu 

His Ser 

Asn Lys 
420 
Val Val 
435 

Gly Phe 
Ser Thr 
Ser Val 



Arg Val 

Gin Phe 

Glu Lys 
150 
Pro Arg 
165 

Asp Asp 

Gly Ala 

Gly Leu 

Asp Gin 
230 
He Lys 
245 

Ala Gly 

Glu Glu 

Arg Asp 

He Asn 
310 
Asn Val 
325 

Glu Leu 

Leu Tyr 

Thr Val 

Lys Ala 
390 
Tyr Phe 
405 

Cys Tyr 

Gly Phe 

Ala Ala 

He Gly 
470 
Thr Lys 
485 



Ala Leu 
120 
He Gly 
135 

He Tyr 

Tyr Leu 

Leu Phe 

Ser Tyr 
200 
Asp Val 
215 

Asp Met 

Phe He 

Thr Pro 

He He 
280 

Ala Cys 
295 

Glu Lys 

Pro Tyr 

Thr Pro 

Ala Gly 
360 
Phe Thr 
375 

Val Glu 

Trp Pro 

Ala Lys 

His Val 
440 
Ala Leu 
455 

He His 
Arg Ser 



Arg Glu 

Pro His 

Ser Ala 

Gly He 
170 
Ser Leu 
185 

Val Ala 

Thr Val 

Ala Asn 

Arg Gin 
250 
Gly Arg 
265 

Glu Gly 

Thr Arg 

Thr Gly 

He Tyr 
330 
Val Ala 
345 

Ser Thr 

Pro Leu 

Lys Phe 

Leu Glu 
410 
He He 
425 

Leu Gly 

Lys Cys 

Pro Val 

Gly Ala 
490 



Lys Lys Val 
125 

Arg He Lys 

140 
Glu Arg Phe 
155 

Pro Gly Asp 

Pro Tyr Cys 

Leu Glu Cys 
205 

Met Val Arg 

220 
Lys He Gly 
235 

Phe Val Pro 

Leu Arg Val 

Glu Tyr Asn 
285 

Lys He Gly 

300 
Lys He Pro 
315 

Ala He Gly 

He Gin Ala 

Val Lys Cys 
365 

Glu Tyr Gly 
380 

Gly Glu Glu 
395 

Trp Thr He 

Cys Asn Thr 

Pro Asn Ala 
445 

Gly Leu Thr 

460 
Cys Ala Glu 
475 

Ser He Leu 



Val Tyr Glu 

Ala Thr Asn 

Leu He Ala 
160 

Lys Glu Tyr 

175 
Pro Gly Lys 
190 

Ala Gly Phe 

Ser He Leu 

Glu His Met 
240 

He Lys Val 

255 
Val Ala Gin 
270 

Thr Val Met 

Leu Glu Thr 

Val Thr Asp 
320 

Asp He Leu 

335 
Gly Arg Leu 
350 

Asp Tyr Glu 

Ala Cys Gly 

Asn He Glu 
400 

Pro Ser Arg 

415 
Lys Asp Asn 
430 

Gly Glu Val 

Lys Lys Gin 

Val Phe Thr 
480 

Gin Ala Gly 
495 



<210> 293 
<211> 521 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 520 

<223> Xaa = Any Amino Acid 
<400> 293 

Met Ala Val Ala Leu Arg Gly Leu Gly Gly Arg Phe Arg Trp Arg Thr 

15 10 ~ 15 

Gin Ala Val Ala Gly Gly Val Arg Gly Ala Ala Arg Gly Ala Ala Ala 
20 25 30 
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Gly Gin 


Arrr 

Aiy 


A C3T*» 

AJbp 


ryr 


Ren 
Ao p 


XtCLL ucu veil 


Val Glv Glv Glv cjer Glv Glv 

vax vjrxy vjxy vjxy oex \jxjf wijr 




35 








40 


45 


Leu Al a 

xj^vx nxo 


v-y O 


AT a 


xjy o 


Glu 


Ala Ala Gin 
**xa ax cx vjxix 


T.pii Glv Ara* Lvej Val Ala Val 
xjcLL vjxy fliy xjjto vax /ixci vax 


50 










55 


60 


Val Asp 


xyx 


Val 
v ax 


Gl n 

VJX LL 


Pro 


Cpr Pro Gl n 
L-»ex riv vj J 1 1 


Glv Thr Arcr Tm Glv Leu Glv 

vjxy xxxx riiy ax^j vjxjf xicu wxjr 


65 








70 




75 80 


Gly Thr 


Cys 


Val 


Asn 


Val 


Glv Cvs Tie 


Pro Lvs Lvs Leu Met His Gin 








85 






90 95 


Ala Ala 

*X*X. tt «fj In SA 


Leu 


Leu 


Glv 

vjxy 


Glv 

vjxy 


Leu Tie Gin 

J-ieLX XXC VjXXX 


Asn Ala Pro Asn Tvr Glv Tm 






100 






105 


110 


Glu Val 


Ala 


Gin 


Pro 


Val 


Prn Hi a Asn 

* 1U XXX O JTXOp 


Tm Ara Lvs Met Ala Glu Ala 




115 








120 


125 


Val Gin 

V C1X VJXXX 


Asn 


His 

XIX o 


Val 


Lys 


OCX X1CLL riDll 


Tm Glv Hi ci Arrr Val Gin T»f»n 
xxp vjx y xxx o "i-y vax vjxii ucu 


130 










135 


140 


Gl n A cr> 

wXil nop 


Arg 


XJy O 


Val 

V CXX 


xjy o 


TWry Pli0 Son 

x y x xr lie aoxx 


Tl r,vq Ala S£*r Prt^ Val Asn 
xxc xiy o irxxa o^x xrxxc vax nop 


145 








150 




155 160 


Gin Wis 

VJ1U XX J. o 


Thx 


Val 


Cys 


Gly 


Va 1 Al n T iVR 
V CtX AX cl xiy O 


Glv Glv Tivn Gin Tit* T»^ii Tt^n 
vjxy vjxy xjy d vjxll xxc jjcu ucu 








165 






170 175 


Co-*- i\1 a 


Asp 


His 


lie 


lie 


lie Ala Thr 

XXC AXCl xxxx 


Gl v Gl v Arcr Pro Arcr Tvr Pro 






180 






185 




Thr His 


lie 


Glu 


Glv 

vjxy 


Ala 


Leu Glii Tvr 


Glv lie Thr Ser Asr> Asd Tie 




195 








200 


205 


Phe Trp 


Leu 


Lys 


Glu 


Ser 


Pro Glv Lvs 


Thr Leu Val Val Glv Ala Ser 


210 










215 


220 


Tvr Val 

xyx V d-L 


Ala 


Leu 


Glu 


Cys 


Al a Gl v Phe 
x^xcl vjxy xrxxc 


Tif>n Thr Glv I If* Glv Leu A cm 

ucu xxxx vjxy xxc vjx y xjcll nDy 


225 








230 




235 240 


llil. XXI J. 


Tl e 

11C 


i'lCL 


Mpf- 
iuC L, 




Qpw Tip Dt"A 
OCX XXC XrlLv 


Tttaii l\rn Glv Ph e± IVcsn Gin Gin 
XJC LX nxy vjxy xrxxc nop vjxxx vjxxx 








245 






250 255 


ft/To 4~ Cav 


O o >* 
OCX 


Mot- 

rict. 


TT- "I 

Val 


Tl o 
11c 


Gin ITic Mof* 
vjIU xlx 6 Met 


7\ 1 -a Cor Hi c Gl \r Thr Z\r<"f Pno 
Ala ocx £11 d vjiy xxxx Aiy xrxxc 






V w 






265 


270 


T.on A *r*rT 

jjcu ax y 


rjl v- 
vjiy 




Al a 
Ala 


nu 


ocx Aiy Veil 


x\rrr l\rrr T»pii Pro lorv Gl v Gin 
ax y Axy xjcll xr x vj nop vjxy vjxxx 




i ~> 








A O W 


285 


T.on Gl n 

UC LL VJX11 


V Ctl 


XI XX 


irp 


Gl n 


7\ n-n Co-v- Tilt* 
HSp OCX XXXX 


Thr Glv TiVQ Gin Ben Thr Gl \r 
x xxx vjxy nys vjxll nop xxxx vjxy 


290 










295 


300 


Thr Phe 

X 11X XT IXC 


AOp 


Thr 

XXXX 


Val 
v a x 


Leu 


Tm Ala Tl e 

X X LJ Aid lie 


Glv x\rrr Val Pro 1\ on Thr Arrr 
vjxy nxy vax rxu nop xxxx Axy 


305 








310 




315 320 


OCX itcu 


Zi can 
xVip 11 


XlC LI 


Gl ii 

UI LL 


xiy o 


Ala Glv Val 
nia vjxy v ax 


Acir) Thr ^pr Pro Acsn Thr Gin 
Aou xxxx ocx nu nop xxxx vjxxx 








325 




330 335 


TiVct Tl p 
uy a x x c 


T»en 

X-lC. LL 


Val 
val 


A em 


Cpr 
OCX 


Am Gl 1 1 11 a 
niy ui u ax a 


Thr Ser Val Pro TTi q Tie Tvr 
xxxx ocx vax xr xu xxx o xxc xyx 






340 






J Tt -J 


350 


Ala Tie 

AXcl 11C 


Gl v 
vjxy 


7\ en 
Ao p 


Val 

V cLX 


Val 

V ctX 


Gl ii Gl 7\ T - rT 
VJXLL vjiy AX y 


Prn- Glu Tten Thr Pro Tie Ala 

XtXvJ vjxll XJCLL XXXX XT X Ly xxc nxa 




O -J J 










365 


Tip Mpf- 


Al s 


Gl v 
uiy 




iteu 


ucu Val vjlll 


7\ >-rT T.on PVip Gl \r Gl v Cor Qpr 
ax y xjcll it xxc vjxy vjxy ocx ocx 


370 










375 


380 


A err* Tieil 
nop UCU 


Mer 

l iC L> 


Asri 


Tvr 
j.yi 


Asp 


A c^n \7=a 1 Prn 
nan vax rxu 


Thr Thr Val Phe Thr Pro Tien 

xxxx xxxx vax nic xxxx xxvj xjevx 


385 




390 




395 400 


Glu Tyx 


Gly 


Cys 


Val 


Gly 


XJCLL OCX VJX LL 


Glu Glu Ala Val Ala Ara His 

VJXVX VwJ -L. V_A Al 1 1, ^4. V MX Ai ,1. C* L 11X13 








405 






410 415 


Gl v Gl n 


Glu 


His 


Val 


Glu 


Va 1 Tvr TTi <i 
vax xyx xxxo 


Ala Hi 1 ? Tvr Lvs Pro Leu Gin 

nxci *xj. 0 x y x uyo x- jl xj^u vjxu 




420 






425 


430 


Phe Thx* 


Val 


Ala 


Gly 


ax y 


Ac;ri Ala Oar 
nap Axa ocx 


Gl n Pvs Tvr Val Tivs Mer Val 

vjxxx V— y a vix xj y o nc v cix 




435 








440 


445 


Or/C! T ,pi I 
V^jrB UCU 


Arcr 
Aiy 


Glu 


Pro 


Pro 


Gin T,pn Val 

VjXXX XJC LL V a X 


Leu Glv Leu His Phe Leu Glv 

XiW LX V7X V XJ^«-X il il O XilG JJ\« LX VJXjr 


450 










455 


460 


Pro Asn 


Ala 


Gly 


Glu 


Val 


TVir Gl n Gl v 
xxxx vsxxx vjxy 


Phe Al a Leu Glv Tie Lvs Cvs 


465 








470 




475 480 


Gly Ala 


Ser 


Tyr 


Ala 


Gin 


Val Met Arg 


Thr Val Gly lie His Pro Thr 








485 






490 495 


Cys Ser 


Glu 


Glu 


Val 


Val 


Lys Leu Arg 


He Ser Lys Arg Ser Gly Leu 






500 






505 


510 


Asp Pro 


Thr 


Val 


Thr 


Gly 


Cys Xaa Gly 






515 








520 





<210> 294 
<211> 579 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 578 

<223> Xaa = Any Amino Acid 
<400> 294 

Ala Glu Arg Val Val lie Phe Ser Lys Ser Tyr Cys Pro His Ser Thr 

1 5 10 15 

Arg Val Lys Glu Leu Phe Ser Ser Leu Gly Val Glu Cys Asn Val Leu 

20 25 30 

Glu Leu Asp Gin Val Asp Asp Gly Ala Arg Val Gin Glu Val Leu Ser 

35 40 ~ 45 

Glu lie Thr Asn Gin Lys Thr Val Pro Asn He Phe Val Asn Lys Val 

50 55 60 

His Val Gly Gly Cys Asp Gin Thr Phe Gin Ala Tyr Gin Ser Gly Leu 
65 70 75 80 

Leu Gin Lys Leu Leu Gin Glu Asp Leu Ala Tyr Asp Tyr Asp Leu He 

85 " 90 "* 95 

lie He Gly Gly Gly Ser Gly Gly Leu Ser Cys Ala Lys Glu Ala Ala 

100 105 110 

He Leu Gly Lys Lys Val Met Val Leu Asp Phe Val Val Pro Ser Pro 

115 120 " 125 

Gin Gly Thr Ser Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 

130 135 140 

He Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
145 150 155 "* 160 

Cys Asp Ser Arg Lys Phe Gly Trp Glu Tyr Asn Gin Gin Val Arg His 

165 170 175 

Asn Trp Glu Thr Met Thr Lys Ala He Gin Asn His He Ser Ser Leu 

180 185 190 

Asn Trp Gly Tyr Arg Leu Ser Leu Arg Glu Lys Ala Val Ala Tyr Val 

195 200 205 

Asn Ser Tyr Gly Glu Phe Val Glu His His Lys He Lys Ala Thr Asn 

210 215 220 

Lys Lys Gly Gin Glu Thr Tyr Tyr Thr Ala Ala Gin Phe Val He Ala 
225 230 " 235 240 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Gin Gly Asp Lys Glu Tyr 

245 250 255 

Cys He Thr Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

260 * 265 270 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

275 280 285 

Leu Ala Gly Phe Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 

290 295 300 

Leu Arg Gly Phe Asp Gin Glu Met Ala Glu Lys Val Gly Ser Tyr Met 
305 310 315 320 

Glu Gin His Gly Val Lys Phe Leu Arg Lys Phe He Pro Val Met Val 

325 330 335 

Gin Gin Leu Glu Lys Gly Ser Pro Gly Lys Leu Lys Val Leu Ala Lys 

340 345 350 

Ser Thr Glu Gly Thr Glu Thr He Glu Gly Val Tyr Asn Thr Val Leu 

355 360 365 

Leu Ala He Gly Arg Asp Ser Cys Thr Arg Lys He Gly Leu Glu Lys 

370 375 380 

He Gly Val Lys He Asn Glu Lys Ser Gly Lys He Pro Val Asn Asp 
385 390 395 400 

Val Glu Gin Thr Asn Val Pro Tyr Val Tyr Ala Val Gly Asp He Leu 

405 410 415 

Glu Asp Lys Pro Glu Leu Thr Pro Val Ala He Gin Ser Gly Lys Leu 

420 • 425 430 

Leu Ala Gin Arg Leu Phe Gly Ala Ser Leu Glu Lys Cys Asp Tyr He 

435 440 * 445 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Cys Cys Gly 

450 455 460 

Leu Ser Glu Glu Lys Ala He Glu Val Tyr Lys Lys Glu Asn Leu Glu 
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465 470 475 480 

lie Tyr His Thr Leu Phe Trp Pro Leu Glu Trp Thr Val Ala Gly Arg 

485 490 495 

Glu Asn Asn Thr Cys Tyr Ala Lys lie lie Cys Asn Lys Phe Asp His 

500 505 510 

Asp Arg Val He Gly Phe His He Leu Gly Pro Asn Ala Gly Glu Val 

515 520 525 

Thr Gin Gly Phe Ala Ala Ala Met Lys Cys Gly Leu Thr Lys Gin Leu 

530 535 540 

Leu Asp Asp Thr He Gly He His Pro Thr Cys Gly Glu Val Phe Thr 
545 550 555 560 

Thr Leu Glu He Thr Lys Ser Ser Gly Leu Asp He Thr Gin Lys Gly 
565 570 575 

Cys Xaa Gly 



<210> 295 
<211> 524 
<212> PRT 
<213> Homo sap i en 

<220> 

<221> VARIANT 
<222> 523 

<223> Xaa = Any Amino Acid 
<400> 295 

Met Ala Ala Met Ala Val Ala Leu Arg Gly Leu Gly Gly Arg Phe Arg 

1 5 10 15 

Trp Arg Thr Gin Ala Val Ala Gly Gly Val Arg Gly Ala Ala Arg Gly 

20 25 30 

Ala Ala Ala Gly Gin Arg Asp Tyr Asp Leu Leu Val Val Gly Gly Gly 

35 40 45 

Ser Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly Arg Lys 

50 55 60 

Val Ala Val Val Asp Tyr Val Glu Pro Ser Pro Gin Gly Thr Arg Trp 
65 70 75 80 

Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys He Pro Lys Lys Leu 

85 90 95 

Met His Gin Ala Ala Leu Leu Gly Gly Leu He Gin Asp Ala Pro Asn 

100 105 110 

Tyr Gly Trp Glu Val Ala Gin Pro Val Pro His Asp Trp Arg Lys Met 

115 120 125 

Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly His Arg 

130 135 140 

Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn He Lys Ala Ser 
145 150 155 160 

Phe Val Asp Glu His Thr Val Cys Gly Val Ala Lys Gly Gly Lys Glu 

165 170 175 

He Leu Leu Ser Ala Asp His He He He Ala Thr Gly Gly Arg Pro 

180 185 ~* 190 

Arg Tyr Pro Thr His He Glu Gly Ala Leu Glu Tyr Gly He Thr Ser 

195 200 205 

Asp Asp He Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu Val Val 

210 215 220 

Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr Gly He 
225 230 235 240 

Gly Leu Asp Thr Thr He Met Met Arg Ser He Pro Leu Arg Gly Phe 

245 250 255 

Asp Gin Gin Met Ser Ser Met Val He Glu His Met Ala Ser His Gly 

260 265 270 

Thr Arg Phe Leu Arg Gly Cys Ala Pro Ser Arg Val Arg Arg Leu Pro 

275 280 285 

Asp Gly Gin Leu Gin Val Thr Trp Glu Asp Ser Thr Thr Gly Lys Glu 

290 295 300 

Asp Thr Gly Thr Phe Asp Thr Val Leu Trp Ala He Gly Arg Val Pro 
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305 








310 






Thr 


Arcr 


Ser 


Leu Asn Leu 

*JvU 


Glu 










325 




Asp 


Thr 


Gin 


Lys 


Tie Teu Val 


Asp 








340 






His 


lie 


TVt* 
xyir 


Ala 


lie \jj.y 


Val 






355 






360 




Tip 


Ala 

AX CL 


He 


Met AT a filv 
ncL rvm vjxy 


m y 




370 






375 




Glv 


Ser 


Ser 


Asp 


Leu Met Asn 


Tvr 


385 








390 




Thr 


Pro 


Leu 


Glu 


Tvr f?l v CVs 


Val 










405 




Ala 


Arcr 


His 


Glv 


Gin Glu His 


Val 








420 






Pro 


Leu 


Glu 


Phe 


Thr Val Ala 


Glv 






435 






440 


Lys 


Met 


Val 


Cys 


T,pti Ayef r?lti 


Pro 




450 






455 




Phe 




v 

uiy 


IT A \J 




Glu 


465 








470 




He 


Lys 


Cys 


Gly 


Ala Ser Tyr 


Ala 










485 




His 


Pro 


Thr 


Cys 


Ser Glu Glu 


Val 








500 






Ser 


Gly 


Leu 


Asp 


Pro Thr Val 


Thr 






515 




520 



315 320 



Lys 


Ala 


Gly Val Asp Thr Ser Pro 




330 


335 


Ser 


Arq 


Glu Ala Thr Ser Val Pro 


345 




350 


Val 


Glu 


Gly Arg Pro Glu Leu Thr 






365 


Leu 


Leu 


Val Gin Arg Leu Phe Gly 






380 


Asp 


Asn 


Val Pro Thr Thr Val Phe 






395 400 


Gly 


Leu 


Ser Glu Glu Glu Ala Val 




410 


415 


Glu 


Val 


Tyr His Ala His Tyr Lys 


425 




430 


Arg 


Asp 


Ala Ser Gin Cys Tyr Val 






445 


Pro 


Gin 


Leu Val Leu Gly Leu His 






460 


Val 


Thr 


Gin Gly Phe Ala Leu Gly 






475 480 


Gin 


Val 


Met Arg Thr Val Gly He 




490 


495 


Val 


Lys 


Leu Arg He Ser Lys- Arg 


505 




510 


Gly 


Cys 


Xaa Gly 



<210> 296 

<211> 577 

<212> PRT 

<213> Homo sapien 

<220> 

<221> VARIANT 

<222> 576 

<223> Xaa = Any Amino Acid 



<400> 296 



Arg 


Val 


Val 


He 


Phe 


Ser 


Lys 


Ser 


Tyr 


Cys 


Pro His Ser Thr Arg Val 


1 








5 






10 


15 


Lys 


Glu 


Leu 


Phe 


^Ser 


Ser 


Leu 


Gly 


Val 


Glu 


Cys Asn Val Leu Glu Leu 






20 










25 




30 


Asp 


Gin 


Val 


Asp 


Asp 


Gly 


Ala 


Arg 


Val 


Gin 


Glu Val Leu Ser Glu He 






35 










40 






45 


Thr 


Asn 


Gin 


Lys 


Thr 


Val 


Pro 


Asn 


He 


Phe 


Val Asn Lys Val His Val 




50 








55 








60 


Gly 


Gly 


Cys 


Asp 


Gin 


Thr 


Phe 


Gin 


Ala 


Tyr 


Gin Ser Gly Leu Leu Gin 


65 










70 










75 80 


Lys 


Leu 


Leu 


Gin 


Glu 


Asp 


Leu 


Ala 


Tyr 


Asp 


Tyr Asp Leu He He lie 










85 










90 


95 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Ser 


Cys 


Ala 


Lys Glu Ala Ala He Leu 








100 










105 




110 


Gly 


Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 


Val 


Val Pro Ser Pro Gin Gly 






115 










120 






125 


Thr 


Ser 


Trp 


Gly 


Leu 


Gly 


Gly 


Thr 


Cys 


Val 


Asn Val Gly Cys He Pro 




130 










135 








140 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly Gin Ala Leu Cys Asp 


145 










150 










155 160 


Ser 


Arg 


Lys 


Phe 


Gly 


Trp 


Glu 


Tyr 


Asn 


Gin 


Gin Val Arg His Asn Trp 










165 










170 


175 


Glu 


Thr 


Met 


Thr 


Lys 


Ala 


He 


Gin 


Asn 


His 


He Ser Ser Leu Asn Trp 








180 










185 




190 


Gly 


Tyr 


Arg 


Leu 


Ser 


Leu 


Arg 


Glu 


Lys 


Ala 


Val Ala Tyr Val Asn Ser 




195 










200 






205 


Tyr 


Gly 


Glu 


Phe 


Val 


Glu 


His 


His 


Lys 


He 


Lys Ala Thr Asn Lys Lys 
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210 








215 


220 


Gly 


Gin 


Glu 


Thr 


Tyr 


Tyr Thr Ala Ala Gin Phe 


Val He Ala Thr Gly 


225 










230 235 


240 


Glu 


Ara 


Pro 


Ara 


Tyr 


Leu Gly He Gin Gly Asp 


Lys Glu Tyr Cys He 










245 


250 


255 


Thr 


Ser 


Asp 


Asp 


Leu 


Phe Ser Leu Pro Tyr Cys 


Pro Gly Lys Pro Leu 








260 




265 


270 


Val 


Val 


Gly 


Ala 


Ser 


Tyr Val Ala Leu Glu Cys 


Ala Gly Phe Leu Ala 






275 






280 


285 


Gly 


Phe 


Gly 


Leu 


Asp 


Val Thr Val Met Val Arg 


Ser He Leu Leu Arg 




290 








295 


300 


Gly 


Phe 


Asp 


Gin 


Glu 


Met Ala Glu Lys Val Gly 


Ser Tyr Met Glu Gin 


305 










310 315 


320 


His 


Gly 


Val 


Lvs 


Phe 


Leu Arg Lys Phe He Pro 


Val Met Val Gin Gin 










325 


330 


335 


Leu 


Glu 


Lvs 


Glv 


Ser 


Pro Gly Lys Leu Lys Val 


Leu Ala Lys Ser Thr 








340 




345 


350 


Glu 


Gly 


Thr 


Glu 


Thr 


He Glu Gly Val Tyr Asn 


Thr Val Leu Leu Ala 






355 






3 60 


365 


He 


Gly 


Ara 


Asp 


Ser 


Cys Thr Arg Lys He Gly 


Leu Glu Lys He Gly 




370 








375 


380 


Val 


Lys 


He 


Asn 


Glu 


Lys Ser Gly Lys He Pro 


Val Asn Asp Val Glu 


385 










390 "* " 395 


400 


Gin 


Thr 


Asn 


Val 


Pro 


Tyr Val Tyr Ala Val Gly 


Asp He Leu Glu- Asp 










405 


410 


415 


Lvs 


Pro 


Glu 


Leu 


Thr 


Pro Val Ala He* Gin Ser 


Gly Lys Leu Leu Ala 








420 




425 


430 


Gin 


Ara 


Leu 


Phe 


Glv 


Ala Ser Leu Glu Lys Cys 


Asp Tyr He Asn Val 






435 






440 


445 


Pro 


Thr 


Thr 


Val 


Phe 


Thr Pro Leu Glu Tyr Gly 


Cvs Cvs Glv Leu Ser 




450 








455 


460 


Glu 


Glu 


Lys 


Ala 


He 


Glu Val Tyr Lys Lys Glu 


Asn Leu Glu He Tyr 


465 










470 A ~ 475 


480 


His 


Thr 


Leu 


Phe 


Tro 


Pro Leu Glu Trp Thr Val 


Ala Gly Arg Glu Asn 










485 


490 


495 


Asn 


Thr 


Cys 


Tyr 


Ala 


Lys He He Cys Asn Lys 


Phe Asp His Asp Arg 








500 




505 


510 


Val 


He 


Gly Phe 


His 


He Leu Gly Pro Asn Ala 


Gly Glu Val Thr Gin 






515 






520 


525 


Gly 


Phe 


Ala 


Ala 


Ala 


Met Lys Cys Gly Leu Thr 


Lys Gin Leu Leu Asp 




530 








535 


540 


Asp 


Thr 


He Gly 


He 


His Pro Thr Cys Gly Glu 


Val Phe Thr Thr Leu 


545 










550 " 555 


560 


Glu 


He 


Thr Lys 


Ser 


Ser Gly Leu Asp He Thr 


Gin Lys Gly Cys Xaa 










565 


570 


575 


Gly 















<210> 297 
<211> 494 
<212> PRT 
<213> Homo sapien 



<400> 297 



Met Glu Asp 


Gin 


Ala 


Gly Gin Arg 


1 




5 




Gly Gly Ser 


Gly 


Gly 


Leu Ala Cys 




20 






Arg Lys Val 


Ala 


Val 


Val Asp Tyr 


35 






40 


Arg Trp Gly 


Leu 


Gly 


Gly Thr Cys 


50 






55 


Lys Leu Met 


His 


Gin 


Ala Ala Leu 


65 






70 


Pro Asn Tyr 


Gly 


Trp 


Glu Val Ala 






85 





Asp Tyr Asp Leu Leu Val' Val Gly 

10 15 
Ala Lys Glu Ala Ala Gin Leu Gly 
25 30 
Val Glu Pro Ser Pro Gin Gly Thr 
45 

Val Asn Val Gly Cys He Pro Lys 
60 

Leu Gly Gly Leu He Gin Asp Ala 

75 80 
Gin Pro Val Pro His Asp Trp Arg 
90 95 
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Lys Met Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly 

100 105 110 

His Arg Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn lie Lys 

115 120 125 

Ala Ser Phe Val Asp Glu His Thr Val Cys Gly Val Ala Lys Gly Gly 

130 135 140 

Lys Glu lie Leu Leu Ser Ala Asp His lie lie lie Ala Thr Gly Gly 
145 150 155 160 

Arg Pro Arg Tyr Pro Thr His lie Glu Gly Ala Leu Glu Tyr Gly lie 

165 170 175 

Thr Ser Asp Asp He Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu 

180 185 190 

Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr 

195 200 " 205 

Gly He Gly Leu Asp Thr Thr He Met Met Arg Ser. He Pro Leu Arg 

210 215 220 

Gly Phe Asp Gin Gin Met Ser Ser Met Val He Glu His Met Ala Ser 
225 230 235 240 

His Gly Thr Arg Phe Leu Arg Gly Cys Ala Pro Ser Arg Val Arg Arg 

245 " " 250 ~ 255 

Leu Pro Asp Gly Gin Leu Gin Val Thr Trp Glu Asp Ser Thr Thr Gly 

260 265 270 

Lys Glu Asp Thr Gly Thr Phe Asp Thr Val Leu Trp Ala He Gly Arg 

275 280 285 

Val Pro Asp Thr Arg Ser Leu Asn Leu Glu Lys Ala Gly Val Asp Thr 

290 295 300 

Ser Pro Asp Thr Gin Lys He Leu Val Asp Ser Arg Glu Ala Thr Ser 
305 310 315 ~ 320 

Val Pro His He Tyr Ala He Gly Asp Val Val Glu Gly Arg Pro Glu 

325 330 335 

Leu Thr Pro Thr Ala He Met Ala Gly Arg Leu Leu Val Gin Arg Leu 

340 345 350 

Phe Gly Gly Ser Ser Asp Leu Met Asp Tyr Asp Asn Val Pro Thr Thr 

355 360 365 

Val Phe Thr Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu Glu Glu 

370 375 380 

Ala Val Ala Arg His Gly Gin Glu His Val Glu Val Tyr His Ala His 
385 390 395 400 

Tyr Lys Pro Leu Glu Phe Thr Val Ala Gly Arg Asp Ala Ser Gin Cys 

405 410 415 

Tyr Val Lys Met Val Cys Leu Arg Glu Pro Pro Gin Leu Val Leu Gly 

420 425 430 

Leu His Phe Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Ala 

435 440 - 445 

Leu Gly He Lys Cys Gly Ala Ser Tyr Ala Gin Val Met Arg Thr Val 

450 455 460 

Gly He His Pro Thr Cys Ser Glu Glu Val Val Lys Leu Arg He Ser 
465 470 475 480 

Lys Arg Ser Gly Leu Asp Pro Thr Val Thr Gly Cys Cys Gly 
485 ~ 490 



<210> 298 
<211> 521 
<212> PRT 
<213> Homo sapien 

<400> 298 

Met Ala Ala Met Ala Val Ala Leu Arg Gly Leu Gly Gly Arg Phe Arg 

15 10 15 

Trp Arg Thr Gin Ala Val Ala Gly Gly Val Arg Gly Ala Ala Arg Gly 

20 25 30 

Ala Ala Gly Gin Arg Asp Tyr Asp Leu Leu Val Val Gly Gly Gly Ser 

35 40 45 

Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly Arg Lys Val 

50 55 60 

Ser Val Val Asp Tyr Val Glu Pro Ser Pro Gin Gly Thr Arg Trp Gly 
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65 70 75 

Leu Gly Gly Thr Cys Val Asn Val Gly Cys lie 

85 9.0 
His Gin Ala Ala Leu Leu Gly Gly Leu He Gin 

100 105 
Gly Trp Glu Val Ala Gin Pro Val Pro His Asp 

115 120 
Glu Ala Val Gin Asn His Val Lys Ser Leu Asn 

130 135 
Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn 
145 150 155 

Val Asp Glu His Thr Val Cys Gly Val Ala Lys 

165 170 
Leu Leu Ser Ala Asp His He He He Ala Thr 

180 185 
Tyr Pro Thr His He Glu Gly Ala Leu Glu Tyr 

195 200 
Asp He Phe Trp Leu Lys Glu Ser Pro Gly Lys 

210 215 
Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 
225 230 235 

Leu Asp Thr Thr He Met Met Arg Ser He Pro 

245 250 
Gin Gin Met Ser Ser Met Val He Glu His Met 

260 265 
Arg Phe Leu Arg Gly Cys Ala Pro Ser Arg Val 

275 280 
Gly Gin Leu Gin Val Thr Trp Glu Asp Ser Thr 

290 295 
Thr Gly Thr Phe Asp Thr Val Leu Trp Ala He 
305 310 315 

Thr Arg Ser Leu Asn Leu Glu Lys Ala Gly Val 

325 330 
Thr Gin Lys lie Leu Val Asp Ser Arg Glu Ala 

340 345 
He Tyr Ala He Gly Asp Val Val Glu Gly Arg 

355 360 
Thr Ala He Met Ala Gly Arg Leu Leu Val Gin 

370 375 
Ser Ser Asp Leu Met Asp Tyr Asp Asn Val Pro 
385 390 395 

Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu 

405 410 
Arg His Gly Gin Glu His Val Glu Val Tyr His 

420 425 
Leu Glu Phe Thr Val Ala Gly Arg Asp Ala Ser 

435 * 440 

Met Val Cys Leu Arg Glu Pro Pro Gin Leu Val 

450 455 
Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly 
465 470 475 

Lys Cys Gly Ala Ser Tyr Ala Gin Val Met Arg 

485 490 
Pro Thr Cys Ser Glu Glu Val Val Lys Leu Arg 

500 505 
Gly Leu Asp Pro Thr Val Thr Gly Cys 
515 520 



80 

Pro Lys Lys Leu Met 
95 

Asp Ala Pro Asn Tyr 
110 

Trp Arg Lys Met Ala 
125 

Trp Gly His Arg Val 
140 

He Lys Ala Ser Phe 
160 

Gly Gly Lys Glu He 
175 

Gly Gly Arg Pro Arg 
190 

Gly He Thr Ser Asp 
205 

Thr Leu Val Val Gly 
220 

Leu Thr Gly He Gly 
240 

Leu Arg Gly Phe Asp 
255 

Ala Ser His Gly -Thr 
270 

Lys Arg Leu Pro Asp 
285 

Thr Gly Lys Glu Asp 
300 

Gly Arg Val Pro Asp 
320 

Asp Thr Ser Pro Asp 
335 

Thr Ser Val Pro His 
350 

Pro Glu Leu Thr Pro 
365 

Arg Leu Phe Gly Gly 
380 

Thr Thr Val Phe Thr 
400 

Glu Glu Ala Val Ala 
415 

Ala His Tyr Lys Pro 
430 

Gin Cys Tyr Val Lys 
445 

Leu Gly Leu His Phe 
460 

Phe Ala Leu Gly He 
480 

Thr Val Gly He His 
495 

He Ser Lys Arg Ser 
510 



<210> 299 
<211> 549 
<212> PRT 
<213> Homo sapien 

<400> 299 

Met Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu 
1 5 10 15 
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Ala Ala 


Glu 


Thr 


Asp 


Leu 






20 




Gly Gly 


His 


Gly 


Pro 


Thr 




35 








Lys Leu 


Leu 


Lys 


Met 


Asn 


50 










Tyr Asp 


Leu 


He 


He 


He 


65 








70 


Lys Glu 


Ala 


Ala 


Gin 


Tyr 








85 




Thr Pro 


Thr 


Pro 


Leu 


Gly 






100 




Asn Val 


Gly 


Cys 


He 


Pro 




115 








Gly Gin 


Ala 


Leu 


Gin 


Asp 


130 










Thr Val 


Lys 


His 


Asp 


Trp 


145 








150 


lie Gly 


Ser 


Leu 


Asn 


Trp 








165 


Val Val 


Tyr 


Glu 


Asn 


Ala 




180 






Lys Ala 


Thr 


Asn 


Asn 


Lys 




195 








Phe Leu 


He 


Ala 


Thr 


Gly 


210 








Asp Lys 


Glu 


Tyr 


Cys 


He 


225 








230 


Cys Pro 


Gly 


Lys 


Thr 


Leu 








245 




Cys Ala 


Gly 


Phe 


Leu 


Ala 






260 






Arg Ser 


He 


Leu 


Leu 


Arg 


275 






Gly Glu 


His 


Met 


Glu 


Glu 


290 










Pro lie 


Lys 


Val 


Glu 


Gin 


305 






310 


Val Val 


Ala 


Gin 


Ser 


Thr 








325 




Asn Thr 


Val 


Met 


Leu 


Ala 






340 






Gly Leu 


Glu 


Thr 


Val 


Gly 




355 








Pro Val 


Thr 


Asp 


Glu 


Glu 


370 








Gly Asp 


He 


Leu 


Glu 


Asp 


385 








390 


Ala Gly 


Arg 


Leu 


Leu 


Ala 








405 




Cys Asp 


Tyr 


Glu 


Asn 


Val 






420 






Gly Ala 


Cys 


Gly 


Leu 


Ser 




435 








Glu Asn 


He 


Glu 


Val 


Tyr 


450 










He Pro 


Ser 


Arg 


Asp 


Asn 


465 








470 


Thr Lys 


Asp 


Asn 


Glu 


Arg 








485 




Ala Gly 


Glu 


Val 


Thr 


Gin 






500 






Thr Lys 


Lys 


Gin 


Leu 


Asp 




515 








Glu Val 


Phe 


Thr 


Thr 


Leu 


530 










Leu Gin 


Ala 


Gly 


Cys 





Pro 


Val 


Val 


Phe Val 


Lys 






25 






Leu 


Lys 


Ala 


Tyr Gin 


Glu 




40 








Gly 


Pro 


Glu 


Asp Leu 


Pro 


55 








60 


Gly 


Gly 


Gly 


Ser Gly 


Gly 








75 




Gly 


Lys 


Lys 


Val Met 


Val 








90 




Thr 


Arg 


Trp 


Gly Leu 


Gly 






105 






Lys 


Lys 


Leu 


Met His 


Gin 




120 








Ser 


Arg 


Asn 


Tyr Gly 


Trp 


135 








140 


Asp 


Arg 


Met 


He Glu 


Ala 






155 




Gly 


Tyr 


Arg 


Val Ala 


Leu 








170 




Tyr 


Gly 


Gin 


Phe He 


Gly 






185 






Gly 


Lys 


Glu 


Lys He 


Tyr 




200 








Glu 


Arg 


Pro 


Arg Tyr 


Leu 


215 








220 


Ser 


Ser 


Asp 


Asp Leu 


Phe 








235 




Val 


Val 


Gly 


Ma Ser 


Tyr 








250 




Gly 


He 


Gly 


Leu Asp 


Val 






265 






Gly 


Phe 


Asp 


Gin Asp 


Met 




280 








His 


Gly 


He 


Lys Phe 


He 


295 






300 


He 


Glu 


Ala 


Gly Thr 


Pro 








315 




Asn 


Ser 


Glu 


Glu He 


He 








330 




lie 


Gly 


Arg 


Asp Ala 


Cys 






345 






Val 


Lys 


He 


Asn Glu 


Lys 




360 








Gin 


Thr 


Asn 


Val Pro 


Tyr 


375 








380 


Lys 


Val 


Glu 


Leu Thr 


Pro 






395 




Gin 


Arg 


Leu 


Tyr Ala 


Gly 








410 




Pro 


Thr 


Thr 


Val Phe 


Thr 






425 






Glu 


Glu 


Lys 


Ala Val 


Glu 




440 








His 


Ser 


Tyr 


Phe Trp 


Pro 


455 








460 


Asn 


Lys 


Cys 


Tyr Ala 


Lys 








475 




Val 


Val 


Gly 


Phe His 


Val 






490 




Gly 


Phe 


Ala 


Ala Ala 


Leu 






505 






Ser 


Thr 


He 


Gly He 


His 




520 






Ser 


Val 


Thr 


Lys Arg 


Ser 


535 






540 
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Gin Arg 


Lys 


He 


30 






Gly Arg 


Leu 


Gin 


45 






Lys Ser 


Tyr 


Asp 


Leu Ala 


Ala 


Ala 






80 






Val 
Val 




95 




Gly Thr 


Cys 


Val 


110 






Ala Ala 


Leu 


Leu 


125 






Lys Val 


Glu 


Glu 


Val Gin 


Asn 


His 






160 




jj y o 






175 




Pro His 


Arg 


He 


190 






Ser Ala 


Glu 


Arg 


205 






Gly He 


Pro 


Gly 


Ser Leu 


Pro 


Tyr 






240 


Val ala 


Leu 


m ii 

bill 




255 




Thr Val 


Met 


Val 


270 






Ala Asn 


Lys 


He 


285 






Arg Gin 


Phe 


Val 


Gly Arg 


Leu 


Arg 






320 


vjIU vjXy 


uJ-U 


iyr 




335 




Thr Arg 


Lys 


He 


350 






Thr Gly 


Lys 


He 


365 






He Tyr 


Ala 


He 


Val Ala 


He 


Gin 






400 


oer xnr 


vai 


Lys 




415 


Pro Leu 


Glu 


Tyr 


430 






Lys Phe 


Gly 


Glu 


445 






Leu Glu 


Trp 


Thr 


He He 


Cys 


Asn 




480 


Leu Gly 


Pro 


Asn 




495 




Lys Cys 


Gly 


Leu 


510 






Pro Val 


Cys 


Ala 


525 




Gly Ala 


Ser 


He 
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545 , 



<210> 300 
<211> 613 
<212> PRT 

<213> Mus musculus 
<220> 

<221> VARIANT 
<222> 612 

<223> Xaa = Any Amino Acid 



<400> 300 



Met Pro 


Val 


Asp 


Asp 


Cys 


Trp Leu Tyr 


Phe Pro Ala Ser Arg Gly Arg 


1 






5 






10 15 


Thr Phe 


Val 


Gin 


Thr 


Val 


Trp Val Ala 


Pro Thr Cys Pro Asn Cys Cys 






20 






25 


30 


Trp Phe 


Pro 


Gly 


Phe 


Leu 


Pro Pro Val 


Pro Arg Pro Pro His Val Pro 


35 






40 


45 


Arg Val 


Leu 


Leu 


Arg 


Gly 


Pro Arg Gly 


Ala Val Leu Pro Ala Ser Arg 


50 










55 


60 


Pro Ser 


Lys 


Thr 


Leu 


Pro 


Ser Ser Ser 


Gin Thr Pro Cys Pro Thr Asp 


65 








70 




75 . 80 


Pro Cys 


He 


Cys 


Pro 


Pro 


Pro Ser Thr 


Pro Asp Ser Arg Gin Glu Lys 




85 






90 95 


Asn Thr 


Gin 


Ser 


Glu 


Leu 


Pro Asn Lys 


Lys Gly Gin Leu Gin Lys Leu 






100 






105 


110 


Pro Thr 


Met 


Asn 


Gly 


Ser 


Lys Asp Pro 


Pro Gly Ser Tyr Asp Phe Asp 




115 








120 


125 


Leu lie 


He 


He 


Gly 


Gly 


Gly Ser Gly 


Gly Leu Ala Ala Ala Lys Glu 


130 










135 


140 


Ala Ala 


Lys 


Phe 


Asp 


Lys 


Lys Val Leu 


Val Leu Asp Phe Val Thr Pro 


145 








150 




155 160 


Thr Pro 


Leu 


Gly 


Thr 


Arg 


Trp Gly Leu 


Gly Gly Thr Cys Val Asn Val 








165 






170 175 


Gly Cys 


He 


Pro 


Lys 


Lys 


Leu Met His 


Gin Ala Ala Leu Leu Gly Gin 






180 






185 


190 


Ala Leu 


Lys 


Asp 


Ser 


Arg 


Asn Tyr Gly 


Trp Lys Val Glu Asp Thr Val 




195 








200 


205 


Lys His 


Asp 


Trp 


Glu 


Lys 


Met Thr Glu 


Ser Val Gin Ser His He Gly 


210 






215 


220 


Ser Leu 


Asn 


Trp 


Gly 


Tyr 


Arg Val Ala 


Leu Arg Glu Lys Lys Val Val 


225 








230 




235 240 


Tyr Glu 


Asn 


Ala 


Tyr 


Gly 


Arg Phe He 


Gly Pro His Arg He Val Ala 








245 






250 255 


Thr Asn 


Asn 


Lys 


Gly 


Lys 


Glu Lys He 


Tyr Ser Ala Glu Arg Phe Leu 






260 






265 


270 


lie Ala 


Thr 


Gly 


Glu 


Arg 


Pro Arg Tyr 


Leu Gly He Pro Gly Asp Lys 




275 








280 


285 


Glu Tyr 


Cys 


He 


Ser 


Ser 


Asp Asp Leu 


Phe Ser Leu Pro Tyr Cys Pro 


290 










295 


300 


Gly Lys 


Thr 


Leu 


Val 


Val 


Gly Ala Ser 


Tyr Val Ala Leu Glu Cys Ala 


305 








310 




315 320 


Gly Phe 


Leu 


Ala 


Gly 


He 


Gly Leu Asp 


Val Thr Val Met Val Arg Ser 








325 






330 , 335 


lie Leu 


Leu 


Arg 


Gly 


Phe 


Asp Gin Asp 


Met Ala Asn Lys He Gly Glu 






340 






345 


350 


His Met 


Glu 


Glu 


His 


Gly 


He Lys Phe 


He Arg Gin Phe Val Pro Thr 




355 






360 


365 


Lys lie 


Glu 


Gin 


He 


Glu 


Ala Gly Thr 


Pro Gly Arg Leu Arg Val Thr 


370 










375 


380 


Ala Gin 


Ser 


Thr 


Asn 


Ser 


Glu Glu Thr 


He Glu Gly Glu Phe Asn Thr 


385 








390 




395 400 


Val Leu 


Leu 


Ala 


Val 


Gly 


Arg Asp Ser 


Cys Thr Arg Thr He Gly Leu 








405 






410 415 


Glu Thr 


Val 


Gly 


Val 


Lys 


He Asn Glu 


Lys Thr Gly Lys He Pro Val 
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420 



Thr 


Asp 


Glu 


Glu 


Gin Thr 


Asn Val 




435 








440 


He 


Leu 


Glu 


Gly 


Lys Leu 


Glu Leu 




450 










455 


Arg 


Leu 


Leu 


Ala 


Gin Arg 


Leu Tyr 


465 










470 




Tyr 


Asp 


Asn 


Val 


Pro 


Thr 


Thr Val 










485 






Cys 


Gly 


Leu 


Ser 


Glu 


Glu 


Lys Ala 








500 








He 


Glu 


Val 


Tyr 


His 


Ser 


Phe Phe 






515 








520 


Ser 


Arg 


Asp 


Asn 


Asn 


Lys 


Cys Tyr 




530 










535 


Asp 


Asp 


Glu 


Arg 


Val 


Val 


Gly Phe 


545 










550 




Glu 


Val 


Thr 


Gin 


Gly 


Phe 


Ala Ala 










565 






Gin 


Gin 


Leu 


Asp 


Ser 


Thr 


He Gly 








580 








Phe 


Thr 


Thr 


Leu 


Ser 


Val 


Thr Lys 






595 








600 


Ser 


Gly 


Cys 


Xaa 


Gly 







610 



<210> 301 

<211> 310 

<212> PRT 

<213> Mus musculus 



<400> 301 



Met 


Asn 


Gly 


Ser 


Lys Asp 
5 


Pro Pro 


1 
He 


He 


Gly 


Gly 


Gly Ser 


Gly Gly 








20 






Lys 


Phe 


Asp 


Lys 


Lys Val 


Leu Val 






35 






40 


Leu 


Gly 


Thr 


Arg 


Trp Gly 


Leu Gly 




50 








55 


He 


Pro 


Lys 


Lys 


Leu Met 


His Gin 


65 






70 




Lys 


Asp 


Ser 


Arg 


Asn Tyr 


Gly Trp 










85 




Asp 


Trp 


Glu 


Lys 


Met Thr 


Glu Ser 








100 






Asn 


Trp 


Gly 


Tyr 


Arg Val 


Ala Leu 






115 






120 


Asn 


Ala 


Tyr 


Gly 


Arg Phe 


He Gly 




130 








135 


Asn 


Lys 


Gly 


Lys 


Glu Lys 


He Tyr 


145 








150 




Thr 


Gly 


Glu 


Arg 


Pro Arg 


Tyr Leu 










165 




Cys 


He 


Ser 


Ser 


Asp Asp 


Leu Phe 








180 






Thr 


Leu 


Val 


Val 


Gly Ala 


Ser Tyr 






195 






200 


Leu 


Ala 


Gly 


He 


Gly Leu 


Asp Val 




210 








215 


Leu 


Arg 


Gly 


Phe 


Asp Gin 


Asp Met 


225 








230 




Glu" 


Glu 


His 


Gly 


He Lys 


Phe He 








245 




Glu 


Gin 


He 


Glu 


Ala Gly 


Thr Pro 



260 



425 430 



Pro 


Tyr 


lie 


Tyr 


Ala He Gly Asp 










A A C * 


Thr 


Pro 


val 


TV 1 ~ 

Ala 


lie Gin Ala Gly 








460 


Orly 


Gly 


Ser 


Asn 


Val Lys Cys Asp 






A H C 
4 JO 




480 


Pne 


Thr 


Pro 


Leu 


Glu Tyr Gly Cys 




a on 






495 


vai 


GlU 


Lys 


Pne 


Gly Glu Glu Asn 


bub 








blU 


Trp 


Pro 


lieu 


f*l mn 
GlU 


Trp Thr Val Pro 










c o c 
52b 


A±a 


Lys 


lie 


lie 


Cys Asn Leu Lys 












His 


Val 


Leu 


Gly 


Pro Asn Ala Gly 






555 




560 


Ala 


Leu 


Lys 


Cys 


Gly Leu Thr Lys 




570 






575 


He 


His 


Pro 


Val 


Cys Ala Glu He 


585 








590 


Arg 


Ser 


Gly 


Gly 


Asp He Leu Gin 










605 


Gly 


Ser 


Tyr 


Asp 


Phe Asp Leu He 




10 






15 


Leu 


Ala 


Ala 


Ala 


Lys Glu Ala Ala 


25 








3 0 


Leu 


Asp 


Phe 


Val 


Thr Pro Thr Pro 








45 


Gly 


Thr 


Cys 


Val 


Asn Val Gly Cys 








a a 
ou 




Ala 


Ala 


Leu 


Leu 


Gly Gin Ala Leu 






75 




80 


Lys 


Val 


Glu 


Asp 


Thr Val Lys Hxs 




90 






95 


Val 


Gin 


Ser 


His 


He Gly Ser Leu 


105 








110 


Arg 


Glu 


Lys 


Lys 


Val Val Tyr Glu 










125 


Pro 


Hxs 


Arg 


He 


Val Ala Thr Asn 








140 




Ser 


Ala 


Glu 


Arg 


Pne Leu lie Ala 






155 




160 


Gly 


Tl 

lie 


Pro 


Gly 


Asp Lys Glu Tyr 




1 /U 






1 /D 


Ser 


Leu 


Pro 


Tyr 


Cys Pro, Gly Lys 


185 








iyu 


Val 


Ala 


Leu 


Glu 


Cys Ala Gly Pne 












Thr 


Val 


Met 


Val 


Arg Ser He Leu 








220 




Ala 


Asn 


Lys 


He 


Gly Glu His Met 






235 




240 


Arg 


Gin 


Phe 


Val 


Pro Thr Lys He 


250 






255 


Gly 


Arg 


Leu 


Arg 


Val Thr Ala Gin 



265 270 
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Ser Thr Asn Ser Glu 


Glu 


Thr 


He 


Glu 


Gly 


Glu 


Phe 


Asn 


Thr 


Val 


Leu 


275 






280 








285 








Leu Ala Val Gly Arg 


Asp 


Ser 


Cys 


Thr 


Arg 


Thr 


He 


Gly 


Leu 


Glu 


Thr 


290 




295 










300 










Val Gly Val Lys lie 


Asn 






















305 


310 























<210> 302 
<211> 613 
<212> PRT 
<213> Mus musculus 

<400> 302 



Met 


Ser 


Ser 


Pro 


Pro 


Gly 


Arg 


Arg 


Ala 


Arg 


Leu 


Ala 


Ser 


Pro 


HI mm 

Gly 


Tnr 


1 








5 










10 










15 




Ser 


Arg 


Pro 


Ser 


Ser 


Glu 


Ala 


Arg 


Glu 


Glu 


Leu 


Arg 


Arg 


Arg 


Leu 


Arg 








20 








25 










30 






Asp 


Leu 


lie 


Glu 


Gly 


Asn 


Arg 


Val 


Met 


He 


Pne 


Ser 


Lys 


Ser 


Tyr 


rim mm, 

cys 






35 










40 










45 








Pro 


His 


Ser 


Thr 


Arg 


Val 


Lys 


Glu 


Leu 


Phe 


Ser 


Ser 


Leu 


Gly 


Val 


Val 




50 








55 










60 










Tyr 


Asn 


He 


Leu 


Glu 


Leu 


Asp 


Gin 


Val 


Asp 


Asp 


Gly 


TV T _ 

Ala 


Ser 


Val 


m mm 

Gin 


65 










70 








75 










80 


Glu 


Val 


Leu 


Thr 


Glu 


He 


Ser 


Asn 


Gin 


Lys 


Thr 


Val 


Pro 


Asn 


lie 


Phe 










85 










90 










95 




Val 


Asn 


Lys 


Val 


His 


Val 


Gly 


Gly 


Cys 


Asp 


Arg 


Thr 


Phe 


Gin 


Ala 


His 








100 










105 










110 






Gin 


Asn 


Gly 


Leu 


Leu 


Gin 


Lys 


Leu 


Leu 


Gin 


Asp 


Asp 


Ser 


Ala 


His 


Asp 






115 










120 










125 








Tyr 


Asp 


Leu 


He 


He 


He 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Ser 


Cys 


Ala 




130 










135 










140 










Lys 


Glu 


Ala 


Ala 


Asn 


Leu 


Gly 


Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 


Val 


145 










150 










155 










160 


Val 


Pro 


Ser 


Pro 


Gin 


Gly 


Thr 


Thr 


Trp 


Gly 


Leu 


Gly 


Gly 


Thr 


Cys 


Val 










165 










170 










175 




Asn 


Val 


Gly 


Cys 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 






180 






185 










190 






Gly 


His 


Ala 


Leu 


Gin 


Asp 


Ala 


Lys 


Lys 


Tyr 


Gly 


Trp 


Glu 


Tyr 


Asn 


Gin 






195 










200 










205 








Gin 


Val 


Lys 


His 


Asn 


Trp 


Glu 


Ala 


Met 


Thr 


Glu 


Ala 


He 


Gin 


Ser 


His 




210 






215 










220 










He 


Gly 


Ser 


Leu 


Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Thr 


Leu 


Arg 


Glu 


Lys 


Gly 


225 










230 










235 










240 


Val 


Thr 


Tyr 


Val 


Asn 


Ser 


Pne 


Gly 


Glu 


Phe 


Val 


TV ■ ,- 

Asp 


Leu 


HIS 


Lys 


lie 








245 








250 








255 




Lys 


A-L a 


JlIIJL 


Asn 


Lys 


Lys 






V31U 


X ILL. 






X All. 




Oca-*-* 
OCX. 


T,vn 








260 










265 










270 






Phe 


Val 


He 


Ala 


Thr 


Gly 


Glu 


Arg 


Pro 


Arg 


Tyr 


Leu 


Gly 


He 


Gin 


Gly 






275 








280 










285 








Asp 


Lys 


Glu 


Tyr 


Cys 


He 


Thr 


Ser 


Asp 


Asp 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 




290 










295 










300 










Cys 


Pro 


Gly 


Cys 


Thr 


Leu 


Val 


Val 


Gly 


Ala 


Ser 


Tyr 


Val 


Gly 


Leu 


Glu 


305 










310 










315 










320 


Cys 


Ala 


Gly 


Phe 


Leu 


Ala 


Gly 


Leu 


Gly 


Leu 


Asp 


Val 


Thr 


Val 


Met 


Val 










325 










330 








Glu' 


335 




Arg 


Ser 


Val 


Leu 


Leu 


Arg 


Gly 


Phe 


Asp 


Gin 


Glu 


Met 


Ala 


Lys 


Val 






340 










345 










350 






Gly 


Ser 


Tyr 


Leu 


Glu 


Gin 


Gin 


Gly 


Val 


Lys 


Phe 


Gin 


Arg 


Lys 


Phe 


Thr 






355 










360 










365 








Pro 


He 


Leu 


Val 


Gin 


Gin 


Leu 


Glu 


Lys 


Gly 


Leu 


Pro 


Gly 


Lys 


Leu 


Lys 




370 










375 










380 










Val 


Val 


Ala 


Lys 


Ser 


Thr 


Glu 


Gly 


Pro 


Glu 


Thr 


Val 


Glu 


Gly 


He 


Tyr 


385 










390 










395 










400 


Asn 


Thr 


Val 


Leu 


Leu 


Ala 


He 


Gly 


Arg 


Asp 


Ser 


Cys 


Thr 


Arg 


Lys 


He 










405 










410 










415 




Gly 


Leu 


Glu 


Lys 


He 


Gly 


Val 


Lys 


He 


Asn 


Glu 


Lys 


Asn 


Gly 


Lys 


He 
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420 








425 430 


Pro 


Val 


Asn 


Asp 


Val 


Glu 


Gin 


Thr Asn Val Pro His Val Tyr Ala He 






435 










440 445 


Gly 


Asp 


He 


Leu 


Asp 


Gly 


Lys 


Pro Glu Leu Thr Pro Val Ala He Gin 




450 










455 


460 


Ala 


Gly 


Lys 


Leu 


Leu 


Ala 


Arg 


Arg Leu Phe Gly Val Ser Leu Glu Lys 


465 










470 




475 480 


Cys 


Asp 


Tyr 


He 


Asn 


lie 


Pro 


Thr Thr Val Phe Thr Pro Leu Glu Tyr 










485 






490 495 


Gly 


Cys 


Cys 


Gly 


Leu 


Ser 


Glu 


Glu Lys Ala He Glu Met Tyr Lys Lys 








500 








505 510 


Glu 


Asn 


Leu 


Glu 


Val 


Tyr 


His 


Thr Leu Phe Trp Pro Leu Glu Trp Thr 






515 










520 525 


Val 


Ala 


Gly 


Arg 


Asp 


Asn 


Asn 


Thr Cys Tyr Ala Lys He He Cys Asn 




530 










535 


540 


Lys 


Phe 


Asp 


Asn 


Glu 


Arg 


Val 


Val Gly Phe His Leu Leu Gly Pro Asn 


545 










550 




555 560 


Ala 


Gly 


Glu 


He 


Thr 


Gin 


Gly 


Phe Ala Ala Ala Met Lys Cys Gly Leu 










565 






570 575 


Thr 


Lys 


Gin 


Leu 


Leu 


Asp 


Asp 


Thr He Gly He His Pro Thr Cys Gly 








580 








585 590 


Glu 


Val 


Phe 


Thr 


Thr 


Leu 


Glu 


He Thr Lys Ser Ser Gly Leu Asp He 






595 










600 605 


Thr 


Gin 


Lys 


Gly 


Cys 










610 















<210> 303 
<211> 524 
<212> PRT 

<213> Mus musculus 
<220> 

<221> VARIANT 
<222> 523 

<223> Xaa = Any Amino Acid 
<400> 303 

Met Val Ala Ala Met Val Ala Ala Leu Arg Gly Pro Ser Arg Arg Phe 

1 5 10 15 

Arg Pro Arg Thr Arg Ala Leu Thr Arg Gly Thr Arg Gly Ala Ala Ser 

20 25 30 

Ala Ala Gly Gly Gin Gin Ser Phe Asp Leu Leu Val He Gly Gly Gly 

35 40 45 

Ser Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly Lys Lys 

50 55 60 

Val Ala Val Ala Asp Tyr Val Glu Pro Ser Pro Arg Gly Thr Lys Trp 
65 70 75 80 

Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys He Pro Lys Lys Leu 

85 90 95 

Met His Gin Ala Ala Leu Leu Gly Gly Met He Arg Asp Ala His His 

100 105 110 

Tyr Gly Trp Glu Val Ala Gin Pro Val Gin His Asn Trp Lys Thr Met 

115 120 125 

Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly His Arg 

130 135 140 

Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn He Lys Ala Ser 
145 150 155 160 

Phe Val Asp Glu His Thr Val Arg Gly Val Asp Lys Gly Gly Lys Ala 

165 170 . 175 

Thr Leu. Leu Ser Ala Glu His He Val He Ala Thr Gly Gly Arg Pro 

180 185 190 

Arg Tyr Pro Thr Gin Val Lys Gly Ala Leu Glu Tyr Gly lie Thr Ser 

195 200 205 

Asp Asp He Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu Val Val 

210 215 220 

Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr Gly He 
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225 










230 




235 240 


Gly 


Leu 


Asp 


Thr 


Thr 


Val 


Met Met 


Arg Ser He Pro Leu Arg Gly Phe 










245 






250 255 


Asp 


Gin 


Gin 


Met 


Ser 


Ser 


Leu Val 


Thr Glu His Met Glu Ser His Gly 








260 








265 270 


Thr 


Gin 


Phe 


Leu 


Lys 


Gly 


Cys Val 


Pro Ser His He Lys Lys Leu Pro 






275 








280 


285 


Thr 


Asn 


Gin 


Leu 


Gin 


Val 


Thr Trp 


Glu Asp His Ala Ser Gly Lys Glu 




290 










295 


300 


Asp 


Thr 


Gly 


Thr 


Phe 


Asp 


Thr Val 


Leu Trp Ala He Gly Arg Val Pro 


305 










310 




315 " ~ 320 


Glu 


Thr 


Arg 


Thr 


Leu 


Asn 


Leu Glu 


Lys Ala Gly He Ser Thr Asn Pro 










325 






330 335 


Lys 


Asn 


Gin 


Lys 


lie 


He 


Val Asp 


Ala Gin Glu Ala Thr Ser Val Pro 








340 






345 350 


His 


He 


Tyr 


Ala 


He 


Gly 


Asp Val 


Ala Glu Gly Arg Pro Glu Leu Thr 






355 








360 


365 


Pro 


Thr 


Ala 


He 


Lys 


Ala 


Gly Lys 


Leu Leu Ala Gin Arg Leu Phe Gly 




370 










375 


380 


Lys 


Ser 


Ser 


Thr 


Leu 


Met 


Asp Tyr 


Ser Asn Val Pro Thr Thr Val Phe 


385 










390 




395 400 


Thr 


Pro 


Leu 


Glu 


Tyr 


Gly 


Cys Val 


Gly Leu Ser Glu Glu Glu Ala Val 










405 






410 415 


Ala 


Leu 


His 


Gly 


Gin 


Glu 


His Val 


Glu Val Tyr His Ala Tyr Tyr. Lys 








420 








425 430 


Pro 


Leu 


Glu 


Phe 


Thr 


Val 


Ala Asp 


Arg Asp Ala Ser Gin Cys Tyr He 






435 








440 


445 


Lys 


Met 


Val 


Cys 


Met 


Arg 


Glu Pro 


Pro Gin Leu Val Leu Gly Leu His 




450 










455 


460 


Phe 


Leu 


Gly 


Pro 


Asn 


Ala 


Gly Glu 


Val Thr Gin Gly Phe Ala Leu Gly 


465 








470 


475 480 


He 


Lys 


Cys 


Gly 


Ala 


Ser 


Tyr Ala 


Gin Val Met Gin Thr Val Gly He 










485 






490 495 


His 


Pro 


Thr 


Cys 


Ser 


Glu 


Glu Val 


Val Lys Leu His He Ser Lys Arg 








500 








505 510 


Ser 


Gly 


Leu 


Glu 


Pro 


Thr 


Val Thr 


Gly Cys Xaa Gly 






515 








520 





<210> 304 
<211> 528 
<212> PRT 
<213> Mus musculus 

<220> 

<221> VARIANT 
<222> 527 

<223> Xaa = Any Amino Acid 



<400> 304 



Met 


Ala 


Ala 


Met 


Val 


Ala 


Gly Arg 


Met Trp Ala Ala Leu Arg Gly Pro 


1 








5 






10 * 15 


Ser 


Arg 


Arg 


Phe 


Arg 


Pro 


Arg Thr 


Arg Ala Leu Thr Arg Gly Thr Arg 








20 








25 30 


Gly 


Ala 


Ala 


Ser 


Ala 


Ala 


Gly Gly 


Gin Gin Ser Phe Asp Leu Leu Val 






35 








40 


45 


He 


Gly 


Gly 


Gly Ser 


Gly 


Gly Leu 


Ala Cys Ala Lys Glu Ala Ala Gin 




50 










55 


60 


Leu 


Gly 


Lys 


Lys 


Val 


Ala 


Val Ala 


Asp Tyr Val Glu Pro Ser Pro Arg 


65 










70 




75 80 


Gly 


Thr 


Lys 


Trp Gly 


Leu 


Gly Gly 


Thr Cys Val Asn Val Gly Cys He 










85 






90 95 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin Ala 


Ala Leu Leu Gly Gly Met He Arg 








100 








105 ~ 110 


Asp 


Ala 


His 


His 


Tyr 


Gly 


Trp Glu 


Val Ala Gin Pro Val Gin His Asn 






115 






120 


125 


Trp 


Lys 


Thr 


Met 


Ala 


Glu 


Ala Val 


Gin Asn His Val Lys Ser Leu Asn 
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130 135 140 

Trp Gly His Arg Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn 
145 150 155 160 

lie Lys Ala Ser Phe Val Asp Glu His Thr Val Arg Gly Val Asp Lys 

165 170 175 

Gly Gly Lys Ala Thr Leu Leu Ser Ala Glu His lie Val lie Ala Thr 

180 185 190 

Gly Gly Arg Pro Arg Tyr Pro Thr Gin Val Lys Gly Ala Leu Glu Tyr 

195 200 205 

Gly lie Thr Ser Asp Asp lie Phe Trp Leu Lys Glu Ser Pro Gly Lys 

210 215 220 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 
225 230 235 240 

Leu Thr Gly lie Gly Leu Asp Thr Thr Val Met Met Arg Ser lie Pro 

245 *" 250 255 

Leu Arg Gly Phe Asp Gin Gin Met Ser Ser Leu Val Thr Glu His Met 

260 265 270 

Glu Ser His Gly Thr Gin Phe Leu Lys Gly Cys Val Pro Ser His lie 

275 a 280 ~ 285 

Lys Lys Leu Pro Thr Asn Gin Leu Gin Val Thr Trp Glu Asp His Ala 

290 295 300 

Ser Gly Lys Glu Asp Thr Gly Thr Phe Asp Thr Val Leu Trp Ala lie 
305 310 315 • 320 

Gly Arg Val Pro Glu Thr Arg Thr Leu Asn Leu Glu Lys Ala Gly- lie 

325 330 335 

Ser Thr Asn Pro Lys Asn Gin Lys lie He Val Asp Ala Gin Glu Ala 

340 345 350 

Thr Ser Val Pro His He Tyr Ala He Gly Asp Val Ala Glu Gly Arg 

355 360 365 

Pro Glu Leu Thr Pro Thr Ala He Lys Ala Gly Lys Leu Leu Ala Gin 

370 375 380 

Arg Leu Phe Gly Lys Ser Ser Thr Leu Met Asp Tyr Ser Asn Val Pro 
385 ~ * 390 395 400 

Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu 

405 410 415 

Glu Glu Ala Val Ala Leu His Gly Gin Glu His Val Glu Val Tyr His 

420 425 430 

Ala Tyr Tyr Lys Pro Leu Glu Phe Thr Val Ala Asp Arg Asp Ala Ser 

435 " 440 445 

Gin Cys Tyr He Lys Met Val Cys Met Arg Glu Pro Pro Gin Leu Val 

450 * 455 " ~ 460 

Leu Gly Leu His Phe Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly 
465 470 475 480 

Phe Ala Leu Gly He Lys Cys Gly Ala Ser Tyr Ala Gin Val Met Gin 

. 485 490 495 

Thr Val Gly He His Pro Thr Cys Ser Glu Glu Val Val Lys Leu His 

500 505 510 

He Ser Lys Arg Ser Gly Leu Glu Pro Thr Val Thr Gly Cys Xaa Gly 
515 " 520 525 



<210> 305 
<211> 520 
<212> PRT 
<213> Mus tnusculus 

<400> 305 

Met Val Ala Ala Leu Arg Gly Pro 

1 5 
Arg Ala Leu Thr Arg Gly Thr Arg 
20 

Gin Gin Ser Phe Asp Leu Leu Val 

35 40 
Ala Cys Ala Lys Glu Ala Ala Gin 

50 * 55 

Asp Tyr Val Glu Pro Ser Pro Arg 
65 70 



Ser Arg Arg Phe Arg Pro Arg Thr 

10 15 
Gly Ala Ala Ser Ala Ala Gly Gly 
25 30 
He Gly Gly Gly Ser Gly Gly Leu 
45 

Leu Gly Lys Lys Val Ala Val Ala 
60 

Gly Thr Lys Trp Gly Leu Gly Gly 
75 ' 80 
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Thr Cys Val Asn Val Gly Cys He Pro Lys Lys Leu Met His Gin Ala 

85 90 95 

Ala Leu Leu Gly Gly Met He Arg Asp Ala His His Tyr Gly Trp Glu 

100 105 110 

Val Ala Gin Pro Val' Gin His Asn Trp Lys Thr Met Ala Glu Ala Val 

115 120 125 

Gin Asn His Val Lys Ser Leu Asn Trp Gly His Arg Val Gin Leu Gin 

130 135 140 

Asp Arg Lys Val Lys Tyr Phe Asn He Lys Ala Ser Phe Val Asp Glu 
145 150 155 160 

His Thr Val Arg Gly Val Asp Lys Gly Gly Lys Ala Thr Leu Leu Ser 

165 170 175 

Ala Glu His He Val He Ala Thr Gly Gly Arg Pro Arg Tyr Pro Thr 

180 185 190 

Gin Val Lys Gly Ala Leu Glu Tyr Gly lie Thr Ser Asp Asp He Phe 

195 200 ~ 205 

Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu Val Val Gly Ala Ser Tyr 

♦ 210 215 220 

Val Ala Leu Glu Cys Ala Gly Phe Leu Thr Gly He Gly Leu Asp Thr 
225 230 235 240 

Thr Val Met Met Arg Ser He Pro Leu Arg Gly Phe Asp Gin Gin Met 

245 250 " 255 

Ser Ser Leu Val Thr Glu His Met Glu Ser His Gly Thr Gin Phe Leu 

260 265 270 

Lys Gly Cys Val Pro Ser His He Lys Lys Leu Pro Thr Asn Gin Leu 

275 280 285 

Gin Val Thr Trp Glu Asp His Ala Ser Gly Lys Glu Asp Thr Gly Thr 

290 295 300 

Phe Asp Thr Val Leu Trp Ala He Gly Arg Val Pro Glu Thr Arg Thr 
305 310 * 315 320 

Leu Asn Leu Glu Lys Ala Gly He Ser Thr Asn Pro Lys Asn Gin Lys 

325 330 ** 335 

He He Val Asp Ala Gin Glu Ala Thr Ser Val Pro His He Tyr Ala 

340 345 350 

He Gly Asp Val Ala Glu Gly Arg Pro Glu Leu Thr Pro Thr Ala He 

355 360 365 

Lys Ala Gly Lys Leu Leu Ala Gin Arg Leu Phe Gly Lys Ser Ser Thr 

370 375 380 

Leu Met Asp Tyr Ser Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu 
385 390 395 400 

Tyr Gly Cys Val Gly Leu Ser Glu Glu Glu Ala Val Ala Leu His Gly 

405 410 415 

Gin Glu His Val Glu Val Tyr His Ala Tyr Tyr Lys Pro Leu Glu Phe 

420 425 ~ " 430 

Thr Val Ala Asp Arg Asp Ala Ser Gin Cys Tyr He Lys Met Val Cys 

435 440 445 

Met Arg Glu Pro Pro Gin Leu Val Leu Gly Leu His Phe Leu Gly Pro 

450 455 460 

Asn Ala Gly Glu Val Thr Gin .Gly Phe Ala Leu Gly He Lys Cys Gly 
465 470 475 480 

Ala Ser Tyr Ala Gin Val Met Gin Thr Val Gly lie His Pro Thr Cys 

485 490 495 

Ser Glu Glu Val Val Lys Leu His He Ser Lys Arg Ser Gly Leu Glu 

500 , 505 ~ 510 

Pro Thr Val Thr Gly Cys Cys Gly 
515 ' 520 



<210> 306 

<211> 499 

<212> PRT 

<213> Mus musculus 



<400> 306 

Met Asn Gly Ser Lys Asp Pro Pro Gly Ser Tyr Asp Phe Asp Leu He 

15 10 15 

He He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Ala Ala 
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20 25 30 

Lys Phe Asp Lys Lys Val Leu Val Leu Asp Phe Val Thr Pro Thr Pro 

35 40 45 

Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 

50 55 60 

lie Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 ~ 70 75 80 

Lys Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Asp Thr Val Lys His 

85 90 95 

Asp Trp Glu Lys Met Thr Glu Ser Val Gin Ser His lie Gly Ser Leu 

100 105 110 

Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 

115 120 125 

Asn Ala Tyr Gly Arg Phe lie Gly Pro His Arg lie Val Ala Thr Asn 

130 135 140 

Asn Lys Gly Lys Glu Lys lie Tyr Ser Ala Glu Arg Phe Leu lie Ala 
145 * 150 155 ~ 160 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 

165 ^170 175 

Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 185 190 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

195 200 205 

Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He. Leu 

210 215 220 

Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys lie Gly Glu His Met 
225 230 235 240 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro Thr Lys He 

245 250 255 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Thr Ala Gin 

260 265 270 

Ser Thr Asn Ser Glu Glu Thr He Glu Gly Glu Phe Asn Thr Val Leu 

275 280 285 

Leu Ala Val Gly Arg Asp Ser Cys Thr Arg Thr He Gly Leu Glu Thr 

290 295 300 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
305 310 315 320 

Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 

325 330 335 

Glu Gly Lys Leu Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350 

Leu Ala Gin Arg Leu Tyr Gly Gly Ser Asn Val Lys Cys Asp Tyr Asp 

355 360 ~ 365 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Cys Cys Gly 

370 375 380 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn lie Glu 
385 390 ~ 395 400 

Val Tyr His Ser Phe Phe Trp Pro Leu Glu Trp Thr Val Pro Ser Arg 

405 410 415 

Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn Leu Lys Asp Asp 

420 425 430 

Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 

435 440 445 

Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu Thr Lys Gin Gin 

450 455 ~ 460 

Leu Asp Ser Thr He Gly lie His Pro Val Cys Ala Glu He- Phe Thr 
465 470 475 480 

Thr Leu Ser Val Thr Lys Arg Ser Gly Gly Asp He Leu Gin Ser Gly 
485 490 495 

Cys Cys Gly 



<210> 307 

<211> 497 

<212> PRT 

<213> Rattus 



norvegicus 
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<220> 

<221> VARIANT 
<222> 497 

<223> Xaa = Any Amino Acid 
<400> 307 

Met Asn Asp Ser Lys Asp Ala Pro Lys Ser Tyr Asp Phe Asp Leu lie 

1 5 10 15 

He He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Ala Ala 

20 25 30 

Lys Phe Asp Lys Lys Val Met Val Leu Asp Phe Val Thr Pro Thr Pro 

35 40 45 

Leu Gly Thr Asn Gly Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 

50 ~ 55 60 

He Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 80 

Lys Asp Ser Arg Asn Tyr Gly Trp Lys Leu Glu Asp Thr Val Lys His 

85 90 95 

Asp Trp Glu Lys Met Thr Glu Ser Val Gin Asn His He Gly Ser Leu 

100 105 110 

Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 

115 120 125 

Asn Ala Tyr Gly Lys Phe He Gly Pro His Lys He Met Ala Thr Asn 

130 - 135 " 140 

Asn Lys Gly Lys Glu Lys Val Tyr Ser Ala Glu Arg Phe Leu He Ala 
145 150 155 160 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 

165 170 175 

Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 185 ' 190 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

195 200 205 

Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 

210 215 220 

Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He Gly Glu His Met 
225 230 235 240 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro Thr Lys He 

245 250 255 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Lys Val Thr Ala Lys 

260 265 270 

Ser Thr Asn Ser Glu Glu Thr He Glu Asp Glu Phe Asn Thr Val Leu 

275 280 285 

Leu Ala Val Gly Arg Asp Ser Cys Thr Arg Thr He Gly Leu Glu Thr 

290 295 300 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
305 310 315 320 

Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp lie Leu 

325 330 335 

Glu Gly Lys Leu Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350 

Leu Ala Gin Arg Leu Tyr Gly Gly Ser Thr Val Lys Cys Asp Tyr Asp 

355 360 365 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Cys Cys Gly 

370 375 380 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 
385 390 395 400 

Val Tyr His Ser Phe Phe Trp Pro Leu Glu Trp Thr Val Pro Ser Arg 

405 410 415 

Asp Asn Asn Lys Cys Tyr Ala Lys Val He Cys Asn Leu Lys Asp Asn 

420 425 430 

Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 

435 440 445 

Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu Thr Lys Gin Gin 

450 455 460 

Leu Asp Ser Thr He Gly He His Pro Val Cys Ala Glu He Phe Thr 
465 470 475 480 

Thr Leu Ser Val Thr Lys Arg Ser Gly Gly Asp He Leu Gin Ser Gly 
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485 490 495 



Xaa 



<210> 308 
<211> 176 
<212> PRT 

<213> Rattus norvegicus 



<400> 308 



Arg 
1 


lie 


His 


Ala 


Gly 


Gly Ala 


Gly 


Ala 


Gly 


Val 


Phe 


5 
He 


Leu Leu 


Ala 








20 








Arg 


Lys 


Leu 


Ser 


Thr 


Met Asn 


Asp 






35 








40 


Asp 


Phe 


Asp 


Leu 


He 


He He 


Gly 




50 








55 




Ala 


Lys 


Glu 


Ala 


Ala 


Lys Phe 


Asp 


65 










70 




Val 


Thr 


Pro 


Thr 


Pro 


Leu Gly 


Thr 










85 




Val 


Asn 


Val Gly 


Cys 


He Pro 


Lys 








100 








Leu 


Gly 


Gin 


Ala 


Leu 


Lys Asp 


Ser 






115 








120 


Asp 


Thr 


Val 


Lys 


His 


Asp Trp 


Glu 




130 








135 




His 


He 


Gly Ser 


Leu 


Asn Trp 


Gly 


145 










150 




Lys 


Val 


Val 


Tyr 


Glu 


Asn Ala 


Tyr 










165 







Arg Arg Arg Gly Gly Ala. Arg Arg 

10 15 
His Pro Asn Lys Lys Gly Leu Leu 
25 30 
Ser Lys Asp Ala Pro Lys Ser Tyr 
45 

Gly Gly Ser Gly Gly Leu Ala Ala 
60 

Lys Lys Val Met Val Leu Asp Phe 

75 80 
Arg Trp Gly Leu Gly Gly Thr Cys 

90 95 
Lys Leu Met His Gin Ala Ala Leu 
105 110 
Arg Asn Tyr Gly Trp Lys Leu Glu 
125 

Lys Met Thr Glu Ser Val Gin Asn 
140 

Tyr Arg Val Ala Leu Arg Glu Lys 
155 ~ 160 

Gly Lys Phe He Gly Pro His Lys 
170 175 



<210> 309 
<211> 498 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<221> VARIANT 
<222> 497 

<223> Xaa = Any Amino Acid 



<400> 309 



Met 


Asn 


Asp 


Ser 


Lys 


Asp 


Ala 


Pro Lys Ser Tyr Asp Phe 


Asp Leu He 


1 








5 






10 


15 


He 


He 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly Leu Ala Ala Ala Lys 


Glu Ala Ala 








20 








25 


30 


Lys 


Phe 


Asp 


Lys 


Lys 


Val 


Met 


Val Leu Asp Phe Val Thr 


Pro Thr Pro 






35 










40 45 




Leu 


Gly 


Thr 


Asn 


Gly 


Gly 


Leu 


Gly Gly Thr Cys Val Asn 


Val Gly Cys 




50 










55 


60 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin Ala Ala Leu Leu Gly 


Gin Ala Leu 


65 










70 




75 


80 


Lys 


Asp 


Ser 


Arg 


Asn 


Tyr 


Gly 


Trp Lys Leu Glu Asp Thr 


Val Lys His 










85 






90 


95 


Asp 


Trp 


Glu 


Lys 


Met 


Thr 


Glu 


Ser Val Gin Asn His He 


Gly Ser Leu 








100 








105 


110 


Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Ala 


Leu Arg Glu Lys Lys Val 


Val Tyr Glu 






115 










120 125 


Asn 


Ala 


Tyr 


Gly 


Lys 


Phe 


He 


Gly Pro His Lys He Met 


Ala Thr Asn 




130 










135 


140 




Asn 


Lys 


Gly 


Lys 


Glu 


Lys 


Val 


Tyr Ser Ala Glu Arg Phe 


Leu He Ala 


145 








150 




155 


160 


Thr 


Gly 


Glu 


Arg 


Pro 


Arg 


Tyr 


Leu Gly lie Pro Gly Asp 


Lys Glu Tyr 
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Cys lie 

Thr Leu 

Leu Ala 
210 
Leu Arg 
225 

Glu Glu 

Glu Gin 

Ser Thr 

Leu Ala 
290 
Val Gly 
305 

Glu Glu 

Glu Gly 

Leu Ala 

Asn Val 
370 
Leu Ser 
385 

Val Tyr 

Asp Asn 

Glu Arg 

Thr Gin 
450 
Asp Ser 
465 

Leu Ser 
Xaa Gly 



Ser Ser 
180 
Val Val 
195 

Gly He 

Gly Phe 

His Gly 

He Glu 
260 
Asn Ser 
275 

Val Gly 

Val Lys 

Gin Thr 

Lys Leu 
340 
Gin Arg 
355 

Pro Thr 

Glu Glu 

His Ser 

Asn Lys 
420 
Val Val 
435 

Ala Leu 
Thr He 
Val Thr 



165 

Asp Asp 

Gly Ala 

Gly Leu 

Asp Gin 
230 
He Lys 
245 

Ala Gly 

Glu Glu 

Arg Asp 

He Asn 
310 
Asn Val 
325 

Glu Leu 

Leu Tyr 

Thr Val 

Lys Ala 
390 
Phe Phe 
405 

Cys Tyr 

Gly Phe 

Gin Pro 

Gly He 
470 
Lys Arg 
485 



Leu Phe 

Ser Tyr 
200 
Asp Val 
215 

Asp Met 

Phe He 

Thr Pro 

Thr He 
280 
Ser Cys 
295 

Glu Lys 

Pro Tyr 

Thr Pro 

Gly Gly 
360 
Phe Thr 
375 

Val Glu 

Trp Pro 

Ala Lys 

His Val 
440 
Leu Lys 
455 

His Pro 



170 
Ser Leu Pro 
185 

Val Ala Leu 

Thr Val Met 

Ala Asn Lys 
235 

Arg Gin Phe 

250 
Gly Arg Leu 
265 

Glu Asp Glu 

Thr Arg Thr 

Thr Gly Lys 
315 

He Tyr Ala 

330 
Val Ala He 
345 

Ser Thr Val 

Pro Leu Glu 

Lys Phe Gly 
395 

Leu Glu Trp 

410 
Val He Cys 
425 

Leu Gly Pro 
Cys Gly Leu 



Val Cys Ala 
475 

Ser Gly Gly Asp He 
490 



Tyr Cys Pro 
190 

Glu Cys Ala 

205 
Val Arg Ser 
220 

He Gly Glu 

Val Pro Thr 

Lys Val Thr 
270 

Phe Asn Thr 

285 
He Gly Leu 
300 

He Pro Val 

He Gly Asp 

Gin Ala Gly 
350 

Lys Cys Asp 

365 
Tyr Gly Cys 
380 

Glu Glu Asn 

Thr Val Pro 

Asn Leu Lys 
430 

Asn Ala Gly 

445 
Thr Lys Gin 
460 

Glu He Phe 
Leu Gin Ser 



175 

Gly Lys 

Gly Phe 

He Leu 

His Met 
240 
Lys He 
255 

Ala Lys 

Val Leu 

Glu Thr 

Thr Asp 
320 
He Leu 
335 

Arg Leu 

Tyr. Asp 

Cys Gly 

He Glu 
400 
Ser Arg 
415 

Asp Asn 

Glu Val 

Gin Leu 

Thr Thr 
480 
Gly Cys 
495 



<210> 310 
<211> 11 
<212> PRT 

<213> Rattus norvegicus 
<400> 310 

Met Asn Asp Ser Lys Asp Ala Pro Lys Ser Tyr 
15 10 



<210> 311 
<211> 496 
<212> PRT 

<213> Rattus norvegicus 
<400> 311 

Met Asn Asp Ser Lys Asp Ala Pro Lys Ser Tyr Asp Phe Asp Leu He 

15 10 15 

He He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Ala Ala 

20 25 30 

Lys Phe Asp Lys Lys Val Met Val Leu Asp Phe Val Thr Pro Thr Pro 
35 40 45 



-198- 



WO 02/50289 



PCT/US01/50240 



Leu Gly 


Thr Asn 


Gly 


50 






He Pro 


Lys Lys 


Leu 


65 






Lys Asp 


Ser Arg 


Asn 






85 


Asp Trp 


Glu Lys 


Met 




100 




Asn Trp 


Gly Tyr 


Arg 




115 




Asn Ala 


Tyr Gly 


Lys 


130 






Asn Lys 


Gly Lys 


Glu 


145 






Thr Gly 


Glu Arg 


Pro 






165 


Cys He 


Ser Ser 


Asp 




180 




Thr Leu 


Val Val 


Gly 




195 




Leu Ala 


Gly He 


Gly 


210 






Leu Arg 


Gly Phe 


Asp 


225 






Glu Glu 


His Gly 


He 






245 


Glu Gin 


He Glu 


Ala 




260 




Ser Thr 


Asn Ser 


Glu 




275 




Leu Ala 


Val Gly 


Arg 


290 






Val Gly 


Val Lys 


He 


305 






Glu Glu 


Gin Thr 


Asn 






325 


Glu Gly 


Lys Leu 


Glu 




340 




Leu Ala 


Gin Arg 


Leu 




355 




Asn Val 


Pro Thr 


Thr 


370 






Leu Ser 


Glu Glu 


Lys 


385 




Val Tyr 


His Ser 


Phe 




405 


Asp Asn 


Asn Lys 


Cys 




420 




Glu Arg 


Val Val 


Gly 




435 




Thr Gin 


Ala Leu 


Gin 


450 






Asp Ser 


Thr He 


Gly 


465 






Leu Ser 


Val Thr 


Lys 






485 



Gly 


Leu 


Gly Gly Thr Cys 




55 


Met 


His 


Gin Ala Ala Leu 


70 




75 


Tyr 


Gly 


Trp Lys Leu Glu 






90 


Thr 


Glu 


Ser Val Gin Asn 






105 


Val 


Ala 


Leu Arg Glu Lys 






120 


Phe 


He 


Gly Pro His Lys 




135 


Lys 


Val 


Tyr Ser Ala Glu 


150 




155 


Arg 


Tyr 


Leu Gly He Pro 






170 


Asp 


Leu 


Phe Ser Leu Pro 




185 


Ala 


Ser 


Tyr Val Ala Leu 






200 


Leu 


Asp 


Val Thr Val Met 




215 




Gin 


Asp 


Met Ala Asn Lys 


230 




235 


Lys 


Phe 


He Arg Gin Phe 






250 


Gly 


Thr 


Pro Gly Arg Leu 






265 


Glu 


Thr 


He Glu Asp Glu 






280 


Asp 


Ser 


Cys Thr Arg Thr 




295 




Asn 


Glu 


Lys Thr Gly Lys 


310 




315 


Val 


Pro 


Tyr He Tyr Ala 






330 


Leu 


Thr 


Pro Val Ala He 






345 


Tyr 


Gly 


Gly Ser Thr Val 






360 


Val 


Phe 


Thr Pro Leu Glu 




375 




Ala 


Val 


Glu Lys Phe Gly 


390 




395 


Phe 


Trp 


Pro Leu Glu Trp 






410 


Tyr 


Ala 


Lys Val He Cys 






425 


Phe 


His 


Val Leu Gly Pro 






440 


Pro 


Leu 


Lys Cys Gly Leu 




455 




He 


His 


Pro Val Cys Ala 


470 




475 


Arg 


Ser 


Gly Gly Asp He 



490 



Val Asn Val Gly Cys 
60 

Leu Gly Gin Ala Leu 
80 

Asp Thr Val Lys His 
95 

His He Gly Ser Leu 
110 

Lys Val Val Tyr Glu 
125 

He Met Ala Thr Asn 
140 

Arg Phe Leu He Ala 
160 

Gly Asp Lys Glu Tyr 
175 

Tyr Cys Pro Gly Lys 
190 

Glu Cys Ala Gly Phe 
205 

Val Arg Ser He Leu 
220 

He Gly Glu His Met 
. 240 

Val Pro Thr Lys He 
255 

Lys Val Thr Ala Lys 
270 

Phe Asn Thr Val Leu 
285 

He Gly Leu Glu Thr 
300 

He Pro Val Thr Asp 
320 

He Gly Asp He Leu 
335 

Gin Ala Gly Arg Leu 
350 

Lys Cys Asp Tyr Asp 
365 

Tyr Gly Cys Cys Gly 
3 80 

Glu Glu Asn He Glu 
400 

Thr Val Pro Ser Arg 
415 

Asn Leu Lys Asp Asn 
430 

Asn Ala Gly Glu Val 
445 

Thr Lys Gin Gin Leu 
460 

Glu He Phe Thr Thr 
480 

Leu Gin Ser Gly Cys 

, 495 



<210> 312 
<211> 526 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<221> VARIANT 
<222> 525 

<223> Xaa = Any Amino Acid 
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<400> 312 

Met Ala Ala He Val Ala Ala Leu Arg Gly Ser Ser Gly Arg Phe Arg 

15 10 15 

Pro Gin Thr Arg Val Leu Thr Arg Gly Thr Arg Gly Ala Ala Gly Ala 

20 25 30 

Ala Ser Ala Ala Gly Gly Gin Gin Asn Phe Asp Leu Leu Val He Gly 

35 40 45 

Gly Gly Ser Gly Gly Leu Ala Cys Ala Lys Glu Ala Ala Gin Leu Gly 

50 55 60 

Arg Lys Val Ala Val Ala Asp Tyr Val Glu Pro Ser Pro Arg Gly Thr 
65 70 75 ~ 80 

Lys Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys He Pro Lys 

85 90 95 

Lys Leu Met His Gin Ala Ala Leu Leu Gly Gly Met He Arg Asp Ala 

100 105 110 

Gin His Tyr Gly Trp Glu Val Ala Gin Pro Val Gin His Asn Trp Lys 
115 ^ 120 125 

Ala Met Ala Glu Ala Val Gin Asn His Val Lys Ser Leu Asn Trp Gly 

130 135 140 

His Arg Val Gin Leu Gin Asp Arg Lys Val Lys Tyr Phe Asn He Lys 
145 150 ~ 155 160 

Ala Ser Phe Val Asn Glu His Thr Val His Gly Val Asp Lys Ala Gly 

165 170 175 

Lys Val Thr Gin Leu Ser Ala Lys His He Val He Ala Thr Gly Gly 

180 185 190 

Arg Pro Lys Tyr Pro Thr Gin Val Lys Gly Ala Leu Glu His Gly He 

195 200 205 

Thr Ser Asp Asp He Phe Trp Leu Lys Glu Ser Pro Gly Lys Thr Leu 

210 215 220 

Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe Leu Thr 
225 230 235 240 

Gly He Gly Leu Asp Thr Thr Val Met Met Arg Ser Val Pro Leu Arg 

245 250 255 

Gly Phe Asp Gin Gin Met Ala Ser Leu Val Thr Glu His Met Glu Ser 

260 265 270 

His Gly Thr Arg Phe Leu Lys Gly Cys Val Pro Ser Leu He Arg Lys 

275 280 285 

Leu Pro Thr Asn Gin Leu Gin Val Thr Trp Glu Asp Leu Ala Ser Gly 

290 295 300 

Lys Glu Asp Val Gly Thr Phe Asp Thr Val Leu Trp Ala He Gly Arg 
305 310 315 320 

Val Pro Glu Thr Arg Asn Leu Asn Leu Glu Lys Ala Gly Val Asn Thr 

325 330 * 335 

Asn Pro Lys Asn Gin Lys He He Val Asp Ala Gin Glu Ala Thr Ser 

340 345 " 350 

Val Pro His lie Tyr Ala He Gly Asp Val Ala Glu Gly Arg Pro Glu 

355 360 365 

Leu Thr Pro Thr Ala He Lys Ala Gly Lys Leu Leu Ala Gin Arg Leu 

370 375 380 

Phe Gly Lys Ser Ser Thr Leu Met Asn Tyr Ser Asn Val Pro Thr Thr 
385 390 395 400 

Val Phe Thr Pro Leu Glu Tyr Gly Cys Val Gly Leu Ser Glu Glu Glu 

405 410 415 

Ala Val Ala Leu His Gly Gin Glu His He Glu Val Tyr His Ala Tyr 

420 425 430 

Tyr Lys Pro Leu Glu Phe Thr Val Ala Asp Arg Asp Ala Ser Gin Cys 

435 440 ~ 445 

Tyr He Lys Met Val Cys Met Arg Glu Pro Pro Gin Leu Val Leu Gly 

450 455 460 

Leu His Phe Leu Gly Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Ala 
465 470 475 480 

Leu Gly He Gin Cys Gly Ala Ser Tyr Ala Gin Val Met Gin Thr Val 

485 490 495 

Gly He His Pro Thr Cys Ser Glu Glu Val Val Lys Leu His He Ser 

500 505 510 

Lys Arg Ser Gly Leu Asp Pro Thr Val Thr Gly Cys Xaa Gly 
515 520 525 
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<210> 313 
<211> 499 
<212> PRT 
<213> Sus Scrofa 

<220> 

<221> VARIANT 
<222> 498 

<223> Xaa - Any Amino Acid 
<400> 313 

Met Asn Gly Ala Glu Glu Leu Pro Glu Met Tyr Asp Tyr Asp Leu He 

1 5 10 15 

He lie Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Ala Ala 

20 25 30 

Arg Phe Asn Lys Arg Val Met Val Leu Asp Phe Val Thr Pro Thr Pro 

35 40 45 

Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Ser Cys 

50 55 60 

He Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 80 

Arg Asp Ser Arg Asn Tyr Gly Trp Asn Val Glu Glu Thr He Lys His 

85 90 95 

Asp Trp Glu Arg Met Thr Glu Ala Val Gin Asn His He Gly Ser Leu 

100 105 110 

Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Thr Tyr Glu 

115 120 125 

Asn Ala Tyr Gly Gin Phe Val Gly Pro His Arg He Lys Ala Thr Asn 

130 " * 135 140 

Asn Lys Gly Lys Glu Lys He Tyr Ser Ala Glu Lys Phe Leu He Ala 
145 * 150 155 160 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 

165 170 175 

Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 " 185 " 190 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

195 200 205 

Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 

210 215 220 

Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He Gly Glu His Met 
225 230 235 240 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro He Lys Val 

245 250 255 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Val Ala Gin 

260 265 270 

Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr Asn Thr Val Met 

275 280 285 

Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He Gly Leu Glu Thr 

290 295 300 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
305 310 315 320 

Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 

325 330 335 

Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350, 

Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys Cys Asp Tyr Glu 

355 ~~ * 360 365 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Ala Cys Gly 

370 375 380 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 
385 390 395 400 

Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr He Pro Ser Arg 

405 410 415 

Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn Thr Lys Asp Asn 

420 ~ 425 430 

Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 
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435 440 
Thr Gin Gly Phe Ala Ala Ala Leu 

450 455 
Leu Asp Ser Thr He Gly He His 
465 470 
Thr Leu Ser Val Thr Lys Arg Ser 
485 

Cys Xaa Gly 



445 

Lys Cys Gly Leu Thr -Lys Lys Gin 
460 

Pro Val Cys Ala Glu Val Phe Thr 
475 480 
Gly Ala Ser He Leu Gin Ala Gly 
490 495 
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